Project: Predicting The Life time Value of Customers

Description: The Lifetime value of customers is given as the ratio of Cumulative Net Presnt Value and the total customers in the cohort. For our analysis, we shall use machine learning clustering algorith to create the cohorts and then use classification to classify the new customers into their respective cohorts and compute the LTV.

Importing Libraries

In [1]:
import numpy as np
import pandas as pd
import io
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.style as style
import matplotlib
import seaborn as sns
import datetime as dt
from IPython.display import Image
from datetime import datetime
from dateutil import relativedelta
import dill
from mpl_toolkits.mplot3d import Axes3D
import plotly.graph_objs as go
import plotly as py
import warnings
warnings.filterwarnings('ignore')
py.offline.init_notebook_mode(connected = True)
In [2]:
style.use('seaborn-darkgrid') #sets the size of the charts
style.use('ggplot')
matplotlib.rcParams['font.family'] = "serif"
sns.set_context('talk')  

Importing Dataset

In [3]:
df = pd.read_csv("fact_bill_ltv_smry_data_0821_Cust.txt", sep = '|')
In [4]:
df.head()
Out[4]:
Unnamed: 0 525298014 17/03/04 0 719 7 904.34 0.1 1 1.1 ... 646.29 613.33 32.96 9.29 .83 .00 .34 .28 .00.1 .00
0 NaN 925381920 17/07/22 1 51 25 8012.49 5 0 5 ... 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 NaN 825437339 17/11/01 1 51 21 5977.00 0 0 3 ... 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 NaN 689017953 17/10/14 1 51 22 5842.94 0 0 4 ... 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 NaN 373396651 17/11/27 1 51 21 6018.78 0 0 2 ... 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 NaN 588971563 17/06/19 1 51 26 7135.08 4 0 4 ... 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0

5 rows × 55 columns

Sample details of first customer in the Database:

In [5]:
df.columns = ['sno','cust_id', 'cust_estb_dt', 'cust_status_ind', 'recency', 'frequency', 'monetary', 'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt', 'billed_acct_size', 'VoiceTextData_pplangrp_cnt', 'VoiceData_revtype_cnt', 'active_loan_cnt', 'birth_year', 'zip5_cd', 'Loand_equip_cnt', 'BeyondUnlmtd_plan_cnt', 'GoUnlimit_plan_cnt', 'IphoneDev_cnt', 'recurring_paychnl_cnt', 'CA_state_cd', 'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt', 'Edge_contract_cnt', 'CommunicStore_chnl_cnt', 'AndroidDev_cnt', 'WearablDev_cnt', 'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt', 'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt', 'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt', 'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt', 'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt', 'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt', 'gl_tot_service_rev_amt', 'gl_recur_rev_amt', 'gl_nonrecur_rev_amt', 'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn', 'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt', 'voice_overage_amt']
df = df.drop('sno', axis=1)
In [6]:
df.head()
Out[6]:
cust_id cust_estb_dt cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size VoiceTextData_pplangrp_cnt ... gl_tot_service_rev_amt gl_recur_rev_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt
0 925381920 17/07/22 1 51 25 8012.49 5 0 5 5 ... 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 825437339 17/11/01 1 51 21 5977.00 0 0 3 3 ... 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 689017953 17/10/14 1 51 22 5842.94 0 0 4 4 ... 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 373396651 17/11/27 1 51 21 6018.78 0 0 2 2 ... 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 588971563 17/06/19 1 51 26 7135.08 4 0 4 4 ... 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0

5 rows × 54 columns

In [7]:
df.columns
Out[7]:
Index(['cust_id', 'cust_estb_dt', 'cust_status_ind', 'recency', 'frequency',
       'monetary', 'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt',
       'billed_acct_size', 'VoiceTextData_pplangrp_cnt',
       'VoiceData_revtype_cnt', 'active_loan_cnt', 'birth_year', 'zip5_cd',
       'Loand_equip_cnt', 'BeyondUnlmtd_plan_cnt', 'GoUnlimit_plan_cnt',
       'IphoneDev_cnt', 'recurring_paychnl_cnt', 'CA_state_cd',
       'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt', 'Edge_contract_cnt',
       'CommunicStore_chnl_cnt', 'AndroidDev_cnt', 'WearablDev_cnt',
       'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt',
       'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt',
       'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt',
       'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt',
       'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt',
       'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt',
       'gl_tot_service_rev_amt', 'gl_recur_rev_amt', 'gl_nonrecur_rev_amt',
       'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn',
       'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt',
       'voice_overage_amt'],
      dtype='object')
In [8]:
len(df)
Out[8]:
758196
In [9]:
df.isnull().sum()
Out[9]:
cust_id                       0
cust_estb_dt                  0
cust_status_ind               0
recency                       0
frequency                     0
monetary                      0
NewVZPlan_plan_cnt            0
VerizonUNL_plan_cnt           0
billed_acct_size              0
VoiceTextData_pplangrp_cnt    0
VoiceData_revtype_cnt         0
active_loan_cnt               0
birth_year                    0
zip5_cd                       0
Loand_equip_cnt               0
BeyondUnlmtd_plan_cnt         0
GoUnlimit_plan_cnt            0
IphoneDev_cnt                 0
recurring_paychnl_cnt         0
CA_state_cd                   0
Hispanic_ethnicity_cnt        0
data_ovrg_Yes_cnt             0
Edge_contract_cnt             0
CommunicStore_chnl_cnt        0
AndroidDev_cnt                0
WearablDev_cnt                0
SingleLine_plan_cnt           0
TX_state_cd                   0
ATT_port_cnt                  0
DataOnly_Revtype_cnt          0
MatureAdults_cnt              0
handset_paychnl_cnt           0
VoiceOnly_pplangrp_cnt        0
SngleCpleKids_cnt             0
FamTeens_cnt                  0
Purch_equip_cnt               0
MTM_contract_cnt              0
T_Mobile_port_cnt             0
Sprint_port_cnt               0
LocalRetailer_chnl_cnt        0
TwoYear_contract_cnt          0
BasicDev_cnt                  0
arpu_amt                      0
gl_late_fees_amt              0
gl_tot_service_rev_amt        0
gl_recur_rev_amt              0
gl_nonrecur_rev_amt           0
pplan_access_acrtn            0
tot_revenue_acrtn             0
access_rev_acrtn              0
eqp_rev_acrtn                 0
usage_rev_acrtn               0
data_overage_amt              0
voice_overage_amt             0
dtype: int64

There are no Null values in the dataset.

In [10]:
df.head()
Out[10]:
cust_id cust_estb_dt cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size VoiceTextData_pplangrp_cnt ... gl_tot_service_rev_amt gl_recur_rev_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt
0 925381920 17/07/22 1 51 25 8012.49 5 0 5 5 ... 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 825437339 17/11/01 1 51 21 5977.00 0 0 3 3 ... 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 689017953 17/10/14 1 51 22 5842.94 0 0 4 4 ... 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 373396651 17/11/27 1 51 21 6018.78 0 0 2 2 ... 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 588971563 17/06/19 1 51 26 7135.08 4 0 4 4 ... 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0

5 rows × 54 columns

In [11]:
NOW = dt.datetime(2019,8,1) #Today's date (We take Aug 1, as we have data only until july 31)

Where are our customers located?

In [12]:
df_zip = pd.read_csv('us-zip-code-latitude-and-longitude.csv', sep=';')

Outlier Removal

In [13]:
df.describe()
Out[13]:
cust_id cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size VoiceTextData_pplangrp_cnt VoiceData_revtype_cnt ... gl_tot_service_rev_amt gl_recur_rev_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt
count 7.581960e+05 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 ... 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000 758196.000000
mean 6.105298e+08 0.699624 197.979358 20.469335 3598.274701 0.785815 0.987718 2.305512 2.323062 2.151357 ... 2293.113338 2194.204795 98.908543 9.599218 -13.440882 -2.575634 -2.685904 0.393704 1.164720 0.403131
std 2.304322e+08 0.458422 251.255878 8.726280 2740.704143 1.143805 1.751314 1.697620 1.752230 1.577893 ... 1595.544941 1543.570243 217.113144 26.270767 40.332941 8.270055 20.061945 14.348179 10.694771 8.911295
min 2.252509e+08 0.000000 51.000000 1.000000 -10728.090000 0.000000 0.000000 0.000000 0.000000 0.000000 ... -7540.980000 -5443.390000 -9145.000000 -120.000000 -5070.900000 -355.340000 -5874.740000 -1315.370000 -0.040000 0.000000
25% 4.252560e+08 0.000000 51.000000 17.000000 1665.400000 0.000000 0.000000 1.000000 1.000000 1.000000 ... 1200.707500 1127.955000 5.000000 0.000000 -13.020000 -2.410000 -1.870000 0.000000 0.000000 0.000000
50% 6.252509e+08 1.000000 51.000000 23.000000 3079.480000 0.000000 0.000000 2.000000 2.000000 2.000000 ... 2072.735000 1985.830000 30.990000 2.260000 -4.710000 0.000000 -0.030000 0.000000 0.000000 0.000000
75% 7.890506e+08 1.000000 293.000000 27.000000 4999.102500 1.000000 1.000000 3.000000 3.000000 3.000000 ... 3197.330000 3070.000000 119.500000 8.570000 -1.230000 0.000000 0.000000 0.000000 0.000000 0.000000
max 9.890513e+08 1.000000 962.000000 74.000000 95900.620000 30.000000 24.000000 22.000000 30.000000 28.000000 ... 29032.510000 24626.960000 27599.570000 1893.330000 5969.420000 370.000000 2156.840000 4728.000000 2115.000000 4020.340000

8 rows × 53 columns

In [14]:
# Outlier Removal:

def outlier_treatment(datacolumn):
 sorted(datacolumn)
 Q1,Q3 = np.percentile(datacolumn , [25,75])
 IQR = Q3 - Q1
 lower_range = Q1 - (1.5 * IQR)
 upper_range = Q3 + (1.5 * IQR)
 return lower_range,upper_range

print("Outlier boundaries for Monetary Value")
print(outlier_treatment(df['monetary']))
Outlier boundaries for Monetary Value
(-3335.15375, 9999.65625)
In [15]:
df = df[df.monetary > -3335.15375]
df = df[df.monetary < 9999.65625]
In [16]:
len(df)
Out[16]:
735831

We now have 735,831 customers after outlier removal!

Removing unwanted columns:

In [17]:
df.columns
Out[17]:
Index(['cust_id', 'cust_estb_dt', 'cust_status_ind', 'recency', 'frequency',
       'monetary', 'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt',
       'billed_acct_size', 'VoiceTextData_pplangrp_cnt',
       'VoiceData_revtype_cnt', 'active_loan_cnt', 'birth_year', 'zip5_cd',
       'Loand_equip_cnt', 'BeyondUnlmtd_plan_cnt', 'GoUnlimit_plan_cnt',
       'IphoneDev_cnt', 'recurring_paychnl_cnt', 'CA_state_cd',
       'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt', 'Edge_contract_cnt',
       'CommunicStore_chnl_cnt', 'AndroidDev_cnt', 'WearablDev_cnt',
       'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt',
       'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt',
       'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt',
       'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt',
       'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt',
       'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt',
       'gl_tot_service_rev_amt', 'gl_recur_rev_amt', 'gl_nonrecur_rev_amt',
       'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn',
       'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt',
       'voice_overage_amt'],
      dtype='object')

Customer Id seems to be the only column that can be removed as it has only unique values.

In [18]:
df = df.drop(['cust_id'], axis = 1)
In [19]:
df = df.drop(['cust_estb_dt'], axis = 1)

Pandas Profiling

In [20]:
import pandas_profiling
profile = pandas_profiling.ProfileReport(df)
In [21]:
profile
Out[21]:

Overview

Dataset info

Number of variables 53
Number of observations 735831
Total Missing (%) 0.0%
Total size in memory 297.5 MiB
Average record size in memory 424.0 B

Variables types

Numeric 47
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 5
Unsupported 0

Warnings

Variables

ATT_port_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.36437
Minimum 0
Maximum 11
Zeros (%) 78.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 11
Range 11
Interquartile range 0

Descriptive statistics

Standard deviation 0.82867
Coef of variation 2.2742
Kurtosis 9.8599
Mean 0.36437
MAD 0.56934
Skewness 2.8866
Sum 268115
Variance 0.68669
Memory size 5.6 MiB
Value Count Frequency (%)  
0 574875 78.1%
 
1 94054 12.8%
 
2 41538 5.6%
 
3 14285 1.9%
 
4 8184 1.1%
 
5 2229 0.3%
 
6 503 0.1%
 
7 107 0.0%
 
8 33 0.0%
 
9 14 0.0%
 
Other values (2) 9 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 574875 78.1%
 
1 94054 12.8%
 
2 41538 5.6%
 
3 14285 1.9%
 
4 8184 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 107 0.0%
 
8 33 0.0%
 
9 14 0.0%
 
10 7 0.0%
 
11 2 0.0%
 

AndroidDev_cnt
Numeric

Distinct count 17
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.67345
Minimum 0
Maximum 27
Zeros (%) 56.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 27
Range 27
Interquartile range 1

Descriptive statistics

Standard deviation 0.96866
Coef of variation 1.4383
Kurtosis 5.5411
Mean 0.67345
MAD 0.7599
Skewness 1.8993
Sum 495549
Variance 0.93831
Memory size 5.6 MiB
Value Count Frequency (%)  
0 415139 56.4%
 
1 205407 27.9%
 
2 76584 10.4%
 
3 23852 3.2%
 
4 10626 1.4%
 
5 3018 0.4%
 
6 842 0.1%
 
7 241 0.0%
 
8 73 0.0%
 
9 25 0.0%
 
Other values (7) 24 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 415139 56.4%
 
1 205407 27.9%
 
2 76584 10.4%
 
3 23852 3.2%
 
4 10626 1.4%
 

Maximum 5 values

Value Count Frequency (%)  
12 1 0.0%
 
13 1 0.0%
 
14 1 0.0%
 
15 1 0.0%
 
27 1 0.0%
 

BasicDev_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.058897
Minimum 0
Maximum 10
Zeros (%) 94.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.2708
Coef of variation 4.5979
Kurtosis 55.757
Mean 0.058897
MAD 0.11169
Skewness 5.9257
Sum 43338
Variance 0.073334
Memory size 5.6 MiB
Value Count Frequency (%)  
0 697716 94.8%
 
1 33772 4.6%
 
2 3750 0.5%
 
3 407 0.1%
 
4 130 0.0%
 
5 37 0.0%
 
6 9 0.0%
 
9 4 0.0%
 
10 2 0.0%
 
8 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 697716 94.8%
 
1 33772 4.6%
 
2 3750 0.5%
 
3 407 0.1%
 
4 130 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
6 9 0.0%
 
7 2 0.0%
 
8 2 0.0%
 
9 4 0.0%
 
10 2 0.0%
 

BeyondUnlmtd_plan_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.32247
Minimum 0
Maximum 16
Zeros (%) 86.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 16
Range 16
Interquartile range 0

Descriptive statistics

Standard deviation 0.98385
Coef of variation 3.051
Kurtosis 16.999
Mean 0.32247
MAD 0.55668
Skewness 3.8353
Sum 237285
Variance 0.96796
Memory size 5.6 MiB
Value Count Frequency (%)  
0 635124 86.3%
 
1 37637 5.1%
 
2 27093 3.7%
 
3 15185 2.1%
 
4 10974 1.5%
 
5 5633 0.8%
 
6 2510 0.3%
 
7 1032 0.1%
 
8 384 0.1%
 
9 156 0.0%
 
Other values (6) 103 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 635124 86.3%
 
1 37637 5.1%
 
2 27093 3.7%
 
3 15185 2.1%
 
4 10974 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
11 24 0.0%
 
12 8 0.0%
 
13 2 0.0%
 
14 1 0.0%
 
16 1 0.0%
 

CA_state_cd
Numeric

Distinct count 19
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1124
Minimum 0
Maximum 21
Zeros (%) 51.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 21
Range 21
Interquartile range 2

Descriptive statistics

Standard deviation 1.5761
Coef of variation 1.4169
Kurtosis 3.6899
Mean 1.1124
MAD 1.1894
Skewness 1.7942
Sum 818526
Variance 2.4842
Memory size 5.6 MiB
Value Count Frequency (%)  
0 378469 51.4%
 
1 147546 20.1%
 
2 91578 12.4%
 
3 48715 6.6%
 
4 34552 4.7%
 
5 18822 2.6%
 
6 9148 1.2%
 
7 3950 0.5%
 
8 1687 0.2%
 
9 728 0.1%
 
Other values (9) 636 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 378469 51.4%
 
1 147546 20.1%
 
2 91578 12.4%
 
3 48715 6.6%
 
4 34552 4.7%
 

Maximum 5 values

Value Count Frequency (%)  
14 10 0.0%
 
15 4 0.0%
 
17 1 0.0%
 
19 1 0.0%
 
21 1 0.0%
 

CommunicStore_chnl_cnt
Numeric

Distinct count 19
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.0009
Minimum 0
Maximum 24
Zeros (%) 50.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 24
Range 24
Interquartile range 2

Descriptive statistics

Standard deviation 1.3845
Coef of variation 1.3832
Kurtosis 3.9193
Mean 1.0009
MAD 1.0153
Skewness 1.7908
Sum 736488
Variance 1.9167
Memory size 5.6 MiB
Value Count Frequency (%)  
0 373049 50.7%
 
1 172145 23.4%
 
2 95128 12.9%
 
3 44792 6.1%
 
4 28506 3.9%
 
5 13311 1.8%
 
6 5468 0.7%
 
7 2070 0.3%
 
8 811 0.1%
 
9 308 0.0%
 
Other values (9) 243 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 373049 50.7%
 
1 172145 23.4%
 
2 95128 12.9%
 
3 44792 6.1%
 
4 28506 3.9%
 

Maximum 5 values

Value Count Frequency (%)  
14 1 0.0%
 
16 2 0.0%
 
17 1 0.0%
 
19 1 0.0%
 
24 1 0.0%
 

DataOnly_Revtype_cnt
Numeric

Distinct count 15
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.36147
Minimum 0
Maximum 17
Zeros (%) 72.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 17
Range 17
Interquartile range 1

Descriptive statistics

Standard deviation 0.67594
Coef of variation 1.87
Kurtosis 11.918
Mean 0.36147
MAD 0.52104
Skewness 2.5836
Sum 265982
Variance 0.45689
Memory size 5.6 MiB
Value Count Frequency (%)  
0 530327 72.1%
 
1 159953 21.7%
 
2 35191 4.8%
 
3 7472 1.0%
 
4 1935 0.3%
 
5 608 0.1%
 
6 166 0.0%
 
7 85 0.0%
 
8 41 0.0%
 
10 27 0.0%
 
Other values (5) 26 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 530327 72.1%
 
1 159953 21.7%
 
2 35191 4.8%
 
3 7472 1.0%
 
4 1935 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
10 27 0.0%
 
11 2 0.0%
 
12 2 0.0%
 
14 2 0.0%
 
17 1 0.0%
 

Edge_contract_cnt
Highly correlated

This variable is highly correlated with Loand_equip_cnt and should be ignored for analysis

Correlation 0.91842

FamTeens_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.6346
Minimum 0
Maximum 17
Zeros (%) 71.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 17
Range 17
Interquartile range 1

Descriptive statistics

Standard deviation 1.2845
Coef of variation 2.0241
Kurtosis 7.044
Mean 0.6346
MAD 0.91163
Skewness 2.516
Sum 466955
Variance 1.65
Memory size 5.6 MiB
Value Count Frequency (%)  
0 528533 71.8%
 
1 86797 11.8%
 
2 52222 7.1%
 
3 28654 3.9%
 
4 21133 2.9%
 
5 10653 1.4%
 
6 4817 0.7%
 
7 1863 0.3%
 
8 711 0.1%
 
9 261 0.0%
 
Other values (6) 187 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 528533 71.8%
 
1 86797 11.8%
 
2 52222 7.1%
 
3 28654 3.9%
 
4 21133 2.9%
 

Maximum 5 values

Value Count Frequency (%)  
11 37 0.0%
 
12 14 0.0%
 
13 7 0.0%
 
14 3 0.0%
 
17 1 0.0%
 

GoUnlimit_plan_cnt
Numeric

Distinct count 18
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.75149
Minimum 0
Maximum 17
Zeros (%) 69.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 17
Range 17
Interquartile range 1

Descriptive statistics

Standard deviation 1.4114
Coef of variation 1.8782
Kurtosis 5.1648
Mean 0.75149
MAD 1.0446
Skewness 2.2066
Sum 552967
Variance 1.9921
Memory size 5.6 MiB
Value Count Frequency (%)  
0 511395 69.5%
 
1 73277 10.0%
 
2 64383 8.7%
 
3 36281 4.9%
 
4 27135 3.7%
 
5 13221 1.8%
 
6 6142 0.8%
 
7 2398 0.3%
 
8 990 0.1%
 
9 367 0.0%
 
Other values (8) 242 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 511395 69.5%
 
1 73277 10.0%
 
2 64383 8.7%
 
3 36281 4.9%
 
4 27135 3.7%
 

Maximum 5 values

Value Count Frequency (%)  
13 7 0.0%
 
14 1 0.0%
 
15 3 0.0%
 
16 1 0.0%
 
17 1 0.0%
 

Hispanic_ethnicity_cnt
Numeric

Distinct count 17
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.68787
Minimum 0
Maximum 16
Zeros (%) 72.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 16
Range 16
Interquartile range 1

Descriptive statistics

Standard deviation 1.3956
Coef of variation 2.0288
Kurtosis 6.6777
Mean 0.68787
MAD 0.99079
Skewness 2.4751
Sum 506158
Variance 1.9476
Memory size 5.6 MiB
Value Count Frequency (%)  
0 529934 72.0%
 
1 75457 10.3%
 
2 51985 7.1%
 
3 30660 4.2%
 
4 23346 3.2%
 
5 13282 1.8%
 
6 6462 0.9%
 
7 2780 0.4%
 
8 1128 0.2%
 
9 473 0.1%
 
Other values (7) 324 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 529934 72.0%
 
1 75457 10.3%
 
2 51985 7.1%
 
3 30660 4.2%
 
4 23346 3.2%
 

Maximum 5 values

Value Count Frequency (%)  
12 37 0.0%
 
13 10 0.0%
 
14 1 0.0%
 
15 4 0.0%
 
16 2 0.0%
 

IphoneDev_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.2057
Minimum 0
Maximum 17
Zeros (%) 31.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 2
95-th percentile 4
Maximum 17
Range 17
Interquartile range 2

Descriptive statistics

Standard deviation 1.2238
Coef of variation 1.015
Kurtosis 2.8246
Mean 1.2057
MAD 0.91866
Skewness 1.4101
Sum 887159
Variance 1.4977
Memory size 5.6 MiB
Value Count Frequency (%)  
1 279648 38.0%
 
0 232634 31.6%
 
2 128904 17.5%
 
3 49999 6.8%
 
4 30340 4.1%
 
5 9976 1.4%
 
6 2859 0.4%
 
7 868 0.1%
 
8 350 0.0%
 
9 138 0.0%
 
Other values (6) 115 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 232634 31.6%
 
1 279648 38.0%
 
2 128904 17.5%
 
3 49999 6.8%
 
4 30340 4.1%
 

Maximum 5 values

Value Count Frequency (%)  
11 14 0.0%
 
12 6 0.0%
 
13 1 0.0%
 
14 2 0.0%
 
17 1 0.0%
 

Loand_equip_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.574
Minimum 0
Maximum 15
Zeros (%) 18.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 2
95-th percentile 4
Maximum 15
Range 15
Interquartile range 1

Descriptive statistics

Standard deviation 1.3306
Coef of variation 0.84533
Kurtosis 1.8558
Mean 1.574
MAD 1.0456
Skewness 1.213
Sum 1158227
Variance 1.7705
Memory size 5.6 MiB
Value Count Frequency (%)  
1 293479 39.9%
 
2 161993 22.0%
 
0 137369 18.7%
 
3 70491 9.6%
 
4 45756 6.2%
 
5 18089 2.5%
 
6 5961 0.8%
 
7 1843 0.3%
 
8 603 0.1%
 
9 176 0.0%
 
Other values (6) 71 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 137369 18.7%
 
1 293479 39.9%
 
2 161993 22.0%
 
3 70491 9.6%
 
4 45756 6.2%
 

Maximum 5 values

Value Count Frequency (%)  
11 7 0.0%
 
12 4 0.0%
 
13 2 0.0%
 
14 1 0.0%
 
15 2 0.0%
 

LocalRetailer_chnl_cnt
Numeric

Distinct count 17
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.40163
Minimum 0
Maximum 16
Zeros (%) 79.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 16
Range 16
Interquartile range 0

Descriptive statistics

Standard deviation 0.98021
Coef of variation 2.4406
Kurtosis 13.174
Mean 0.40163
MAD 0.63809
Skewness 3.2519
Sum 295529
Variance 0.96082
Memory size 5.6 MiB
Value Count Frequency (%)  
0 584534 79.4%
 
1 74742 10.2%
 
2 40181 5.5%
 
3 18053 2.5%
 
4 10596 1.4%
 
5 4572 0.6%
 
6 1893 0.3%
 
7 789 0.1%
 
8 266 0.0%
 
9 111 0.0%
 
Other values (7) 94 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 584534 79.4%
 
1 74742 10.2%
 
2 40181 5.5%
 
3 18053 2.5%
 
4 10596 1.4%
 

Maximum 5 values

Value Count Frequency (%)  
12 9 0.0%
 
13 3 0.0%
 
14 4 0.0%
 
15 1 0.0%
 
16 1 0.0%
 

MTM_contract_cnt
Numeric

Distinct count 20
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.4385
Minimum 0
Maximum 30
Zeros (%) 16.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 2
95-th percentile 4
Maximum 30
Range 30
Interquartile range 1

Descriptive statistics

Standard deviation 1.1824
Coef of variation 0.82195
Kurtosis 4.1995
Mean 1.4385
MAD 0.89743
Skewness 1.5183
Sum 1058487
Variance 1.398
Memory size 5.6 MiB
Value Count Frequency (%)  
1 356517 48.5%
 
2 150910 20.5%
 
0 120856 16.4%
 
3 55321 7.5%
 
4 35591 4.8%
 
5 11443 1.6%
 
6 3259 0.4%
 
7 1103 0.1%
 
8 459 0.1%
 
9 187 0.0%
 
Other values (10) 185 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 120856 16.4%
 
1 356517 48.5%
 
2 150910 20.5%
 
3 55321 7.5%
 
4 35591 4.8%
 

Maximum 5 values

Value Count Frequency (%)  
15 1 0.0%
 
16 2 0.0%
 
17 1 0.0%
 
19 1 0.0%
 
30 1 0.0%
 

MatureAdults_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.40659
Minimum 0
Maximum 21
Zeros (%) 80.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 3
Maximum 21
Range 21
Interquartile range 0

Descriptive statistics

Standard deviation 1.0067
Coef of variation 2.4759
Kurtosis 13.114
Mean 0.40659
MAD 0.65024
Skewness 3.2615
Sum 299181
Variance 1.0134
Memory size 5.6 MiB
Value Count Frequency (%)  
0 588393 80.0%
 
1 68866 9.4%
 
2 39670 5.4%
 
3 19318 2.6%
 
4 10937 1.5%
 
5 5026 0.7%
 
6 2187 0.3%
 
7 868 0.1%
 
8 328 0.0%
 
9 132 0.0%
 
Other values (6) 106 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 588393 80.0%
 
1 68866 9.4%
 
2 39670 5.4%
 
3 19318 2.6%
 
4 10937 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
11 26 0.0%
 
12 11 0.0%
 
13 3 0.0%
 
15 1 0.0%
 
21 1 0.0%
 

NewVZPlan_plan_cnt
Numeric

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.78961
Minimum 0
Maximum 30
Zeros (%) 55.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 30
Range 30
Interquartile range 1

Descriptive statistics

Standard deviation 1.1239
Coef of variation 1.4234
Kurtosis 5.0266
Mean 0.78961
MAD 0.86789
Skewness 1.846
Sum 581016
Variance 1.2632
Memory size 5.6 MiB
Value Count Frequency (%)  
0 404390 55.0%
 
1 178000 24.2%
 
2 94174 12.8%
 
3 35554 4.8%
 
4 15311 2.1%
 
5 5400 0.7%
 
6 1936 0.3%
 
7 685 0.1%
 
8 219 0.0%
 
9 83 0.0%
 
Other values (6) 79 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 404390 55.0%
 
1 178000 24.2%
 
2 94174 12.8%
 
3 35554 4.8%
 
4 15311 2.1%
 

Maximum 5 values

Value Count Frequency (%)  
11 17 0.0%
 
12 4 0.0%
 
13 3 0.0%
 
14 2 0.0%
 
30 1 0.0%
 

Purch_equip_cnt
Numeric

Distinct count 19
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.61292
Minimum 0
Maximum 19
Zeros (%) 58.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 19
Range 19
Interquartile range 1

Descriptive statistics

Standard deviation 0.92301
Coef of variation 1.5059
Kurtosis 10.053
Mean 0.61292
MAD 0.7104
Skewness 2.3773
Sum 451004
Variance 0.85194
Memory size 5.6 MiB
Value Count Frequency (%)  
0 426430 58.0%
 
1 217683 29.6%
 
2 61436 8.3%
 
3 18690 2.5%
 
4 7158 1.0%
 
5 2595 0.4%
 
6 987 0.1%
 
7 413 0.1%
 
8 197 0.0%
 
9 120 0.0%
 
Other values (9) 122 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 426430 58.0%
 
1 217683 29.6%
 
2 61436 8.3%
 
3 18690 2.5%
 
4 7158 1.0%
 

Maximum 5 values

Value Count Frequency (%)  
14 2 0.0%
 
15 2 0.0%
 
16 1 0.0%
 
17 1 0.0%
 
19 1 0.0%
 

SingleLine_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.062683
Minimum 0
Maximum 13
Zeros (%) 94.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 0.28311
Coef of variation 4.5166
Kurtosis 72.985
Mean 0.062683
MAD 0.11853
Skewness 6.2361
Sum 46124
Variance 0.080153
Memory size 5.6 MiB
Value Count Frequency (%)  
0 695686 94.5%
 
1 35281 4.8%
 
2 4121 0.6%
 
3 536 0.1%
 
4 129 0.0%
 
5 50 0.0%
 
6 10 0.0%
 
10 5 0.0%
 
9 5 0.0%
 
7 5 0.0%
 
Other values (2) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 695686 94.5%
 
1 35281 4.8%
 
2 4121 0.6%
 
3 536 0.1%
 
4 129 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
7 5 0.0%
 
9 5 0.0%
 
10 5 0.0%
 
12 2 0.0%
 
13 1 0.0%
 

SngleCpleKids_cnt
Numeric

Distinct count 19
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.90318
Minimum 0
Maximum 19
Zeros (%) 60.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 19
Range 19
Interquartile range 1

Descriptive statistics

Standard deviation 1.4724
Coef of variation 1.6303
Kurtosis 4.8645
Mean 0.90318
MAD 1.0936
Skewness 2.0471
Sum 664591
Variance 2.168
Memory size 5.6 MiB
Value Count Frequency (%)  
0 445479 60.5%
 
1 117122 15.9%
 
2 76371 10.4%
 
3 40909 5.6%
 
4 28976 3.9%
 
5 14976 2.0%
 
6 6804 0.9%
 
7 3006 0.4%
 
8 1183 0.2%
 
9 516 0.1%
 
Other values (9) 489 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 445479 60.5%
 
1 117122 15.9%
 
2 76371 10.4%
 
3 40909 5.6%
 
4 28976 3.9%
 

Maximum 5 values

Value Count Frequency (%)  
14 7 0.0%
 
15 6 0.0%
 
16 1 0.0%
 
17 2 0.0%
 
19 3 0.0%
 

Sprint_port_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.22711
Minimum 0
Maximum 10
Zeros (%) 86.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.68796
Coef of variation 3.0291
Kurtosis 17.303
Mean 0.22711
MAD 0.39386
Skewness 3.8472
Sum 167117
Variance 0.47329
Memory size 5.6 MiB
Value Count Frequency (%)  
0 638032 86.7%
 
1 55699 7.6%
 
2 24865 3.4%
 
3 9606 1.3%
 
4 5766 0.8%
 
5 1501 0.2%
 
6 272 0.0%
 
7 64 0.0%
 
8 16 0.0%
 
9 7 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 638032 86.7%
 
1 55699 7.6%
 
2 24865 3.4%
 
3 9606 1.3%
 
4 5766 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
6 272 0.0%
 
7 64 0.0%
 
8 16 0.0%
 
9 7 0.0%
 
10 3 0.0%
 

TX_state_cd
Numeric

Distinct count 19
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.66742
Minimum 0
Maximum 30
Zeros (%) 71.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 30
Range 30
Interquartile range 1

Descriptive statistics

Standard deviation 1.3301
Coef of variation 1.9929
Kurtosis 7.1497
Mean 0.66742
MAD 0.95329
Skewness 2.4742
Sum 491106
Variance 1.7692
Memory size 5.6 MiB
Value Count Frequency (%)  
0 525502 71.4%
 
1 80024 10.9%
 
2 56759 7.7%
 
3 31848 4.3%
 
4 21562 2.9%
 
5 11246 1.5%
 
6 5216 0.7%
 
7 2191 0.3%
 
8 870 0.1%
 
9 353 0.0%
 
Other values (9) 260 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 525502 71.4%
 
1 80024 10.9%
 
2 56759 7.7%
 
3 31848 4.3%
 
4 21562 2.9%
 

Maximum 5 values

Value Count Frequency (%)  
14 4 0.0%
 
15 1 0.0%
 
16 2 0.0%
 
19 2 0.0%
 
30 1 0.0%
 

T_Mobile_port_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.24391
Minimum 0
Maximum 18
Zeros (%) 85.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 18
Range 18
Interquartile range 0

Descriptive statistics

Standard deviation 0.71179
Coef of variation 2.9183
Kurtosis 18.507
Mean 0.24391
MAD 0.41713
Skewness 3.857
Sum 179475
Variance 0.50664
Memory size 5.6 MiB
Value Count Frequency (%)  
0 629201 85.5%
 
1 63124 8.6%
 
2 25793 3.5%
 
3 9330 1.3%
 
4 6087 0.8%
 
5 1674 0.2%
 
6 420 0.1%
 
7 125 0.0%
 
8 51 0.0%
 
9 14 0.0%
 
Other values (2) 12 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 629201 85.5%
 
1 63124 8.6%
 
2 25793 3.5%
 
3 9330 1.3%
 
4 6087 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
7 125 0.0%
 
8 51 0.0%
 
9 14 0.0%
 
10 11 0.0%
 
18 1 0.0%
 

TwoYear_contract_cnt
Numeric

Distinct count 15
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.26292
Minimum 0
Maximum 15
Zeros (%) 78.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 15
Range 15
Interquartile range 0

Descriptive statistics

Standard deviation 0.56849
Coef of variation 2.1622
Kurtosis 14.03
Mean 0.26292
MAD 0.41344
Skewness 2.8512
Sum 193467
Variance 0.32318
Memory size 5.6 MiB
Value Count Frequency (%)  
0 578532 78.6%
 
1 129010 17.5%
 
2 22615 3.1%
 
3 4169 0.6%
 
4 1086 0.1%
 
5 286 0.0%
 
6 72 0.0%
 
7 28 0.0%
 
8 11 0.0%
 
10 9 0.0%
 
Other values (5) 13 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 578532 78.6%
 
1 129010 17.5%
 
2 22615 3.1%
 
3 4169 0.6%
 
4 1086 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
10 9 0.0%
 
11 1 0.0%
 
13 1 0.0%
 
14 1 0.0%
 
15 2 0.0%
 

VerizonUNL_plan_cnt
Numeric

Distinct count 17
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.84847
Minimum 0
Maximum 19
Zeros (%) 63.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 19
Range 19
Interquartile range 1

Descriptive statistics

Standard deviation 1.4702
Coef of variation 1.7328
Kurtosis 5.6163
Mean 0.84847
MAD 1.0718
Skewness 2.2134
Sum 624327
Variance 2.1615
Memory size 5.6 MiB
Value Count Frequency (%)  
0 464739 63.2%
 
1 114523 15.6%
 
2 64972 8.8%
 
3 37724 5.1%
 
4 25932 3.5%
 
5 15053 2.0%
 
6 7148 1.0%
 
7 3197 0.4%
 
8 1347 0.2%
 
9 619 0.1%
 
Other values (7) 577 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 464739 63.2%
 
1 114523 15.6%
 
2 64972 8.8%
 
3 37724 5.1%
 
4 25932 3.5%
 

Maximum 5 values

Value Count Frequency (%)  
12 47 0.0%
 
13 9 0.0%
 
14 7 0.0%
 
15 2 0.0%
 
19 3 0.0%
 

VoiceData_revtype_cnt
Highly correlated

This variable is highly correlated with VoiceTextData_pplangrp_cnt and should be ignored for analysis

Correlation 0.92659

VoiceOnly_pplangrp_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.059159
Minimum 0
Maximum 12
Zeros (%) 94.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 12
Range 12
Interquartile range 0

Descriptive statistics

Standard deviation 0.27487
Coef of variation 4.6463
Kurtosis 73.018
Mean 0.059159
MAD 0.11223
Skewness 6.3147
Sum 43531
Variance 0.075552
Memory size 5.6 MiB
Value Count Frequency (%)  
0 697982 94.9%
 
1 33133 4.5%
 
2 4060 0.6%
 
3 487 0.1%
 
4 108 0.0%
 
5 38 0.0%
 
9 6 0.0%
 
6 6 0.0%
 
10 5 0.0%
 
7 3 0.0%
 
Other values (2) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 697982 94.9%
 
1 33133 4.5%
 
2 4060 0.6%
 
3 487 0.1%
 
4 108 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
7 3 0.0%
 
9 6 0.0%
 
10 5 0.0%
 
11 2 0.0%
 
12 1 0.0%
 

VoiceTextData_pplangrp_cnt
Highly correlated

This variable is highly correlated with billed_acct_size and should be ignored for analysis

Correlation 0.94893

WearablDev_cnt
Numeric

Distinct count 9
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.1311
Minimum 0
Maximum 8
Zeros (%) 89.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 8
Range 8
Interquartile range 0

Descriptive statistics

Standard deviation 0.42203
Coef of variation 3.219
Kurtosis 18.885
Mean 0.1311
MAD 0.23485
Skewness 3.8806
Sum 96471
Variance 0.17811
Memory size 5.6 MiB
Value Count Frequency (%)  
0 659065 89.6%
 
1 60174 8.2%
 
2 14101 1.9%
 
3 2002 0.3%
 
4 390 0.1%
 
5 73 0.0%
 
6 20 0.0%
 
7 4 0.0%
 
8 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 659065 89.6%
 
1 60174 8.2%
 
2 14101 1.9%
 
3 2002 0.3%
 
4 390 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
4 390 0.1%
 
5 73 0.0%
 
6 20 0.0%
 
7 4 0.0%
 
8 2 0.0%
 

access_rev_acrtn
Numeric

Distinct count 6967
Unique (%) 0.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -2.5996
Minimum -355.34
Maximum 370
Zeros (%) 40.5%

Quantile statistics

Minimum -355.34
5-th percentile -13.54
Q1 -2.38
Median 0
Q3 0
95-th percentile 1.75
Maximum 370
Range 725.34
Interquartile range 2.38

Descriptive statistics

Standard deviation 8.3687
Coef of variation -3.2192
Kurtosis 85.287
Mean -2.5996
MAD 3.9463
Skewness -6.1196
Sum -1912900
Variance 70.034
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 297889 40.5%
 
-2.5 4261 0.6%
 
-1.67 3980 0.5%
 
-5.0 3771 0.5%
 
-10.0 3257 0.4%
 
-1.9 2967 0.4%
 
-2.0 2872 0.4%
 
-16.25 2601 0.4%
 
-1.82 2470 0.3%
 
-1.25 2388 0.3%
 
Other values (6957) 409375 55.6%
 

Minimum 5 values

Value Count Frequency (%)  
-355.34 1 0.0%
 
-355.0 1 0.0%
 
-335.0 1 0.0%
 
-297.88 1 0.0%
 
-287.14 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
180.48 1 0.0%
 
207.9 1 0.0%
 
234.0 1 0.0%
 
250.0 1 0.0%
 
370.0 1 0.0%
 

active_loan_cnt
Numeric

Distinct count 85
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 4.8246
Minimum 0
Maximum 169
Zeros (%) 19.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 2
Q3 6
95-th percentile 20
Maximum 169
Range 169
Interquartile range 5

Descriptive statistics

Standard deviation 7.4198
Coef of variation 1.5379
Kurtosis 18.711
Mean 4.8246
MAD 4.8481
Skewness 3.3662
Sum 3550085
Variance 55.054
Memory size 5.6 MiB
Value Count Frequency (%)  
1 205731 28.0%
 
0 144369 19.6%
 
4 107743 14.6%
 
2 64295 8.7%
 
6 40033 5.4%
 
9 37378 5.1%
 
16 25782 3.5%
 
12 20921 2.8%
 
3 16370 2.2%
 
8 14943 2.0%
 
Other values (75) 58266 7.9%
 

Minimum 5 values

Value Count Frequency (%)  
0 144369 19.6%
 
1 205731 28.0%
 
2 64295 8.7%
 
3 16370 2.2%
 
4 107743 14.6%
 

Maximum 5 values

Value Count Frequency (%)  
140 3 0.0%
 
144 1 0.0%
 
155 1 0.0%
 
156 1 0.0%
 
169 1 0.0%
 

arpu_amt
Numeric

Distinct count 50262
Unique (%) 6.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 157.89
Minimum -2024.3
Maximum 3980.3
Zeros (%) 1.3%

Quantile statistics

Minimum -2024.3
5-th percentile 30.9
Q1 95.16
Median 137.02
Q3 211.55
95-th percentile 329.86
Maximum 3980.3
Range 6004.6
Interquartile range 116.39

Descriptive statistics

Standard deviation 97.453
Coef of variation 0.61724
Kurtosis 38.09
Mean 157.89
MAD 72.925
Skewness 2.3403
Sum 116180000
Variance 9497.1
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 9232 1.3%
 
15.0 326 0.0%
 
0.94 134 0.0%
 
1.02 125 0.0%
 
0.92 113 0.0%
 
1.0 99 0.0%
 
30.0 95 0.0%
 
0.99 84 0.0%
 
1.39 82 0.0%
 
1.52 81 0.0%
 
Other values (50252) 725460 98.6%
 

Minimum 5 values

Value Count Frequency (%)  
-2024.32 1 0.0%
 
-1857.8 1 0.0%
 
-1471.84 1 0.0%
 
-1057.14 1 0.0%
 
-970.51 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
3167.02 1 0.0%
 
3354.68 1 0.0%
 
3616.76 1 0.0%
 
3843.4 1 0.0%
 
3980.26 1 0.0%
 

billed_acct_size
Numeric

Distinct count 18
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.1674
Minimum 0
Maximum 18
Zeros (%) 2.2%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 1
Median 2
Q3 3
95-th percentile 5
Maximum 18
Range 18
Interquartile range 2

Descriptive statistics

Standard deviation 1.4825
Coef of variation 0.68399
Kurtosis 2.4547
Mean 2.1674
MAD 1.1408
Skewness 1.4228
Sum 1594826
Variance 2.1977
Memory size 5.6 MiB
Value Count Frequency (%)  
1 301599 41.0%
 
2 191423 26.0%
 
3 98508 13.4%
 
4 67698 9.2%
 
5 34413 4.7%
 
0 16421 2.2%
 
6 15365 2.1%
 
7 6256 0.9%
 
8 2382 0.3%
 
9 988 0.1%
 
Other values (8) 778 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 16421 2.2%
 
1 301599 41.0%
 
2 191423 26.0%
 
3 98508 13.4%
 
4 67698 9.2%
 

Maximum 5 values

Value Count Frequency (%)  
13 14 0.0%
 
14 5 0.0%
 
15 1 0.0%
 
16 2 0.0%
 
18 1 0.0%
 

birth_year
Numeric

Distinct count 93
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1974.9
Minimum 0
Maximum 2001
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1946
Q1 1964
Median 1979
Q3 1989
95-th percentile 1996
Maximum 2001
Range 2001
Interquartile range 25

Descriptive statistics

Standard deviation 42.302
Coef of variation 0.02142
Kurtosis 1856.3
Mean 1974.9
MAD 14.4
Skewness -39.855
Sum 1453193909
Variance 1789.5
Memory size 5.6 MiB
Value Count Frequency (%)  
1990 22610 3.1%
 
1989 22065 3.0%
 
1991 21937 3.0%
 
1992 21380 2.9%
 
1988 20984 2.9%
 
1993 20660 2.8%
 
1987 20277 2.8%
 
1986 19480 2.6%
 
1994 19362 2.6%
 
1985 19119 2.6%
 
Other values (83) 527957 71.7%
 

Minimum 5 values

Value Count Frequency (%)  
0 288 0.0%
 
1910 19 0.0%
 
1911 44 0.0%
 
1912 8 0.0%
 
1913 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1997 14687 2.0%
 
1998 12862 1.7%
 
1999 7592 1.0%
 
2000 31 0.0%
 
2001 1 0.0%
 

cust_status_ind
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.69112
1
508551
0
227280
Value Count Frequency (%)  
1 508551 69.1%
 
0 227280 30.9%
 

data_overage_amt
Numeric

Distinct count 3189
Unique (%) 0.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1831
Minimum -0.04
Maximum 2115
Zeros (%) 82.9%

Quantile statistics

Minimum -0.04
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 4.775
Maximum 2115
Range 2115
Interquartile range 0

Descriptive statistics

Standard deviation 10.743
Coef of variation 9.0805
Kurtosis 6769.3
Mean 1.1831
MAD 2.0175
Skewness 60.202
Sum 870580
Variance 115.42
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 610109 82.9%
 
0.48 2482 0.3%
 
0.71 2241 0.3%
 
0.6 2200 0.3%
 
0.62 2147 0.3%
 
0.58 2085 0.3%
 
0.75 2025 0.3%
 
0.65 2002 0.3%
 
1.5 2001 0.3%
 
5.0 1999 0.3%
 
Other values (3179) 106540 14.5%
 

Minimum 5 values

Value Count Frequency (%)  
-0.04 1 0.0%
 
-0.03 2 0.0%
 
0.0 610109 82.9%
 
0.01 361 0.0%
 
0.02 192 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1386.94 1 0.0%
 
1447.5 1 0.0%
 
1515.0 1 0.0%
 
1725.0 1 0.0%
 
2115.0 1 0.0%
 

data_ovrg_Yes_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.19522
Minimum 0
Maximum 6
Zeros (%) 82.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 6
Range 6
Interquartile range 0

Descriptive statistics

Standard deviation 0.4564
Coef of variation 2.3379
Kurtosis 6.7281
Mean 0.19522
MAD 0.32331
Skewness 2.4712
Sum 143651
Variance 0.2083
Memory size 5.6 MiB
Value Count Frequency (%)  
0 609310 82.8%
 
1 110917 15.1%
 
2 14234 1.9%
 
3 1234 0.2%
 
4 119 0.0%
 
5 14 0.0%
 
6 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 609310 82.8%
 
1 110917 15.1%
 
2 14234 1.9%
 
3 1234 0.2%
 
4 119 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2 14234 1.9%
 
3 1234 0.2%
 
4 119 0.0%
 
5 14 0.0%
 
6 3 0.0%
 

eqp_rev_acrtn
Numeric

Distinct count 12123
Unique (%) 1.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -2.6415
Minimum -3472.5
Maximum 2156.8
Zeros (%) 25.3%

Quantile statistics

Minimum -3472.5
5-th percentile -12.44
Q1 -1.75
Median -0.02
Q3 0
95-th percentile 1.67
Maximum 2156.8
Range 5629.3
Interquartile range 1.75

Descriptive statistics

Standard deviation 18.867
Coef of variation -7.1425
Kurtosis 5309.5
Mean -2.6415
MAD 4.5563
Skewness -21.009
Sum -1943700
Variance 355.96
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 186205 25.3%
 
-0.01 29063 3.9%
 
-0.02 6584 0.9%
 
-1.26 2808 0.4%
 
-0.63 2294 0.3%
 
-1.04 2282 0.3%
 
-0.03 2272 0.3%
 
-1.4 2160 0.3%
 
-0.04 2081 0.3%
 
-1.09 2074 0.3%
 
Other values (12113) 498008 67.7%
 

Minimum 5 values

Value Count Frequency (%)  
-3472.48 1 0.0%
 
-2907.0 1 0.0%
 
-2462.99 1 0.0%
 
-2362.99 1 0.0%
 
-2018.98 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1818.84 1 0.0%
 
1821.0 1 0.0%
 
1872.25 1 0.0%
 
1910.02 1 0.0%
 
2156.84 1 0.0%
 

frequency
Numeric

Distinct count 60
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 20.264
Minimum 1
Maximum 62
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 2
Q1 16
Median 23
Q3 27
95-th percentile 30
Maximum 62
Range 61
Interquartile range 11

Descriptive statistics

Standard deviation 8.7591
Coef of variation 0.43226
Kurtosis -0.37567
Mean 20.264
MAD 6.9862
Skewness -0.90594
Sum 14910574
Variance 76.721
Memory size 5.6 MiB
Value Count Frequency (%)  
21 54848 7.5%
 
20 53304 7.2%
 
22 46305 6.3%
 
23 44918 6.1%
 
25 44469 6.0%
 
24 43958 6.0%
 
26 43313 5.9%
 
29 43077 5.9%
 
27 41710 5.7%
 
28 41004 5.6%
 
Other values (50) 278925 37.9%
 

Minimum 5 values

Value Count Frequency (%)  
1 11282 1.5%
 
2 30460 4.1%
 
3 15660 2.1%
 
4 22610 3.1%
 
5 12828 1.7%
 

Maximum 5 values

Value Count Frequency (%)  
56 1 0.0%
 
57 2 0.0%
 
60 3 0.0%
 
61 1 0.0%
 
62 1 0.0%
 

gl_late_fees_amt
Numeric

Distinct count 15396
Unique (%) 2.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 25.172
Minimum -316.5
Maximum 590.06
Zeros (%) 34.4%

Quantile statistics

Minimum -316.5
5-th percentile 0
Q1 0
Median 10
Q3 38.41
95-th percentile 105
Maximum 590.06
Range 906.56
Interquartile range 38.41

Descriptive statistics

Standard deviation 34.976
Coef of variation 1.3895
Kurtosis 1.5819
Mean 25.172
MAD 27.492
Skewness 1.5413
Sum 18522000
Variance 1223.3
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 252874 34.4%
 
5.0 107046 14.5%
 
10.0 56521 7.7%
 
15.0 32470 4.4%
 
20.0 23335 3.2%
 
25.0 18360 2.5%
 
30.0 14931 2.0%
 
35.0 12903 1.8%
 
40.0 11424 1.6%
 
45.0 10405 1.4%
 
Other values (15386) 195562 26.6%
 

Minimum 5 values

Value Count Frequency (%)  
-316.5 1 0.0%
 
-241.93 1 0.0%
 
-193.56 1 0.0%
 
-169.54 1 0.0%
 
-169.33 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
297.15 1 0.0%
 
315.55 1 0.0%
 
326.55 1 0.0%
 
368.25 1 0.0%
 
590.06 1 0.0%
 

gl_nonrecur_rev_amt
Numeric

Distinct count 73737
Unique (%) 10.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 91.546
Minimum -2991.1
Maximum 7781.2
Zeros (%) 16.5%

Quantile statistics

Minimum -2991.1
5-th percentile 0
Q1 5
Median 30
Q3 113.18
95-th percentile 396.2
Maximum 7781.2
Range 10772
Interquartile range 108.18

Descriptive statistics

Standard deviation 182.93
Coef of variation 1.9982
Kurtosis 78.47
Mean 91.546
MAD 105.88
Skewness 5.4964
Sum 67362000
Variance 33464
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 121371 16.5%
 
5.0 43877 6.0%
 
10.0 22911 3.1%
 
15.0 19266 2.6%
 
20.0 11071 1.5%
 
30.0 10827 1.5%
 
25.0 9284 1.3%
 
35.0 7491 1.0%
 
1.99 7115 1.0%
 
45.0 6971 0.9%
 
Other values (73727) 475647 64.6%
 

Minimum 5 values

Value Count Frequency (%)  
-2991.08 1 0.0%
 
-2986.7 1 0.0%
 
-2384.39 1 0.0%
 
-2283.51 1 0.0%
 
-2227.27 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
6571.35 1 0.0%
 
6893.17 1 0.0%
 
6911.15 1 0.0%
 
7087.3 1 0.0%
 
7781.21 1 0.0%
 

gl_recur_rev_amt
Highly correlated

This variable is highly correlated with gl_tot_service_rev_amt and should be ignored for analysis

Correlation 0.99198

gl_tot_service_rev_amt
Highly correlated

This variable is highly correlated with monetary and should be ignored for analysis

Correlation 0.95604

handset_paychnl_cnt
Numeric

Distinct count 17
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.0173
Minimum 0
Maximum 19
Zeros (%) 55.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 19
Range 19
Interquartile range 2

Descriptive statistics

Standard deviation 1.4894
Coef of variation 1.4641
Kurtosis 3.3493
Mean 1.0173
MAD 1.1264
Skewness 1.7602
Sum 748552
Variance 2.2183
Memory size 5.6 MiB
Value Count Frequency (%)  
0 405184 55.1%
 
1 129606 17.6%
 
2 92156 12.5%
 
3 48565 6.6%
 
4 32266 4.4%
 
5 16172 2.2%
 
6 7157 1.0%
 
7 2915 0.4%
 
8 1054 0.1%
 
9 470 0.1%
 
Other values (7) 286 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 405184 55.1%
 
1 129606 17.6%
 
2 92156 12.5%
 
3 48565 6.6%
 
4 32266 4.4%
 

Maximum 5 values

Value Count Frequency (%)  
12 18 0.0%
 
13 10 0.0%
 
14 3 0.0%
 
16 1 0.0%
 
19 1 0.0%
 

index
Numeric

Distinct count 735831
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 379110
Minimum 0
Maximum 758195
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 37890
Q1 189600
Median 379130
Q3 568660
95-th percentile 720270
Maximum 758195
Range 758195
Interquartile range 379060

Descriptive statistics

Standard deviation 218870
Coef of variation 0.57734
Kurtosis -1.2
Mean 379110
MAD 189550
Skewness -0.00021479
Sum 278959902375
Variance 47906000000
Memory size 5.6 MiB
Value Count Frequency (%)  
1 2047 0.3%
 
1 753093 102.3%
 
1 683495 92.9%
 
1 681446 92.6%
 
1 687589 93.4%
 
1 685540 93.2%
 
1 675299 91.8%
 
1 673250 91.5%
 
1 679393 92.3%
 
1 677344 92.1%
 
Other values (735821) 278953703879 37910023.3%
 

Minimum 5 values

Value Count Frequency (%)  
1 2047 0.3%
 
1 753093 102.3%
 
1 683495 92.9%
 
1 681446 92.6%
 
1 687589 93.4%
 

Maximum 5 values

Value Count Frequency (%)  
1 217919 29.6%
 
1 219966 29.9%
 
1 213821 29.1%
 
1 215868 29.3%
 
1 0 0.0%
 

monetary
Numeric

Distinct count 467233
Unique (%) 63.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3346.8
Minimum -3238.4
Maximum 9999.5
Zeros (%) 1.3%

Quantile statistics

Minimum -3238.4
5-th percentile 141.06
Q1 1606.6
Median 3003.1
Q3 4743.2
95-th percentile 7932.9
Maximum 9999.5
Range 13238
Interquartile range 3136.6

Descriptive statistics

Standard deviation 2340.3
Coef of variation 0.69926
Kurtosis -0.1899
Mean 3346.8
MAD 1874.2
Skewness 0.64468
Sum 2462700000
Variance 5477100
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 9228 1.3%
 
15.0 309 0.0%
 
1.85 103 0.0%
 
30.0 100 0.0%
 
2.78 84 0.0%
 
1.82 77 0.0%
 
1.98 76 0.0%
 
2.07 60 0.0%
 
2.01 60 0.0%
 
1.91 56 0.0%
 
Other values (467223) 725678 98.6%
 

Minimum 5 values

Value Count Frequency (%)  
-3238.4 1 0.0%
 
-3237.72 1 0.0%
 
-3013.25 1 0.0%
 
-2934.22 1 0.0%
 
-2740.79 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
9999.19 1 0.0%
 
9999.25 1 0.0%
 
9999.32 1 0.0%
 
9999.36 1 0.0%
 
9999.54 1 0.0%
 

pplan_access_acrtn
Numeric

Distinct count 2675
Unique (%) 0.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 9.1913
Minimum -120
Maximum 1893.3
Zeros (%) 34.3%

Quantile statistics

Minimum -120
5-th percentile 0
Q1 0
Median 2.24
Q3 7.86
95-th percentile 35
Maximum 1893.3
Range 2013.3
Interquartile range 7.86

Descriptive statistics

Standard deviation 26.395
Coef of variation 2.8718
Kurtosis 225.12
Mean 9.1913
MAD 11.275
Skewness 10.125
Sum 6763200
Variance 696.72
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 252157 34.3%
 
2.5 10179 1.4%
 
5.0 9095 1.2%
 
2.41 7967 1.1%
 
2.24 7434 1.0%
 
2.32 7310 1.0%
 
2.6 7054 1.0%
 
1.67 7033 1.0%
 
2.17 6467 0.9%
 
10.0 6000 0.8%
 
Other values (2665) 415135 56.4%
 

Minimum 5 values

Value Count Frequency (%)  
-120.0 1 0.0%
 
-23.33 1 0.0%
 
-21.67 1 0.0%
 
-18.0 1 0.0%
 
-15.71 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1300.0 1 0.0%
 
1430.0 1 0.0%
 
1575.0 1 0.0%
 
1800.0 1 0.0%
 
1893.33 1 0.0%
 

recency
Numeric

Distinct count 31
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 202.33
Minimum 51
Maximum 962
Zeros (%) 0.0%

Quantile statistics

Minimum 51
5-th percentile 51
Q1 51
Median 51
Q3 324
95-th percentile 750
Maximum 962
Range 911
Interquartile range 273

Descriptive statistics

Standard deviation 253.72
Coef of variation 1.254
Kurtosis 0.40653
Mean 202.33
MAD 209.68
Skewness 1.3889
Sum 148881701
Variance 64375
Memory size 5.6 MiB
Value Count Frequency (%)  
51 496826 67.5%
 
597 14640 2.0%
 
628 14170 1.9%
 
658 13349 1.8%
 
566 11694 1.6%
 
689 11203 1.5%
 
719 10243 1.4%
 
538 10208 1.4%
 
750 9815 1.3%
 
507 8625 1.2%
 
Other values (21) 135058 18.4%
 

Minimum 5 values

Value Count Frequency (%)  
51 496826 67.5%
 
81 6833 0.9%
 
112 6034 0.8%
 
142 6310 0.9%
 
173 6621 0.9%
 

Maximum 5 values

Value Count Frequency (%)  
842 6556 0.9%
 
872 5818 0.8%
 
903 3586 0.5%
 
931 2743 0.4%
 
962 236 0.0%
 

recurring_paychnl_cnt
Numeric

Distinct count 18
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.3216
Minimum 0
Maximum 19
Zeros (%) 41.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 2
95-th percentile 5
Maximum 19
Range 19
Interquartile range 2

Descriptive statistics

Standard deviation 1.595
Coef of variation 1.2069
Kurtosis 2.6133
Mean 1.3216
MAD 1.2486
Skewness 1.5122
Sum 972478
Variance 2.5441
Memory size 5.6 MiB
Value Count Frequency (%)  
0 304777 41.4%
 
1 175943 23.9%
 
2 116010 15.8%
 
3 59393 8.1%
 
4 41367 5.6%
 
5 21330 2.9%
 
6 9988 1.4%
 
7 4063 0.6%
 
8 1657 0.2%
 
9 720 0.1%
 
Other values (8) 583 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 304777 41.4%
 
1 175943 23.9%
 
2 116010 15.8%
 
3 59393 8.1%
 
4 41367 5.6%
 

Maximum 5 values

Value Count Frequency (%)  
13 9 0.0%
 
14 3 0.0%
 
15 1 0.0%
 
17 1 0.0%
 
19 2 0.0%
 

tot_revenue_acrtn
Numeric

Distinct count 26333
Unique (%) 3.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -13.353
Minimum -3843.4
Maximum 3227
Zeros (%) 2.2%

Quantile statistics

Minimum -3843.4
5-th percentile -58.74
Q1 -12.46
Median -4.48
Q3 -1.18
95-th percentile 1.86
Maximum 3227
Range 7070.4
Interquartile range 11.28

Descriptive statistics

Standard deviation 39.046
Coef of variation -2.9241
Kurtosis 622.65
Mean -13.353
MAD 16.417
Skewness -3.811
Sum -9825700
Variance 1524.6
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 16268 2.2%
 
0.01 4940 0.7%
 
0.02 1470 0.2%
 
-1.57 1282 0.2%
 
-1.64 1197 0.2%
 
-1.45 1186 0.2%
 
-1.5 1183 0.2%
 
-1.38 1121 0.2%
 
-1.65 1107 0.2%
 
-1.32 1016 0.1%
 
Other values (26323) 705061 95.8%
 

Minimum 5 values

Value Count Frequency (%)  
-3843.4 1 0.0%
 
-3165.34 1 0.0%
 
-2890.66 1 0.0%
 
-2617.37 1 0.0%
 
-2296.67 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2068.56 1 0.0%
 
2116.24 1 0.0%
 
2331.09 1 0.0%
 
2621.98 1 0.0%
 
3227.02 1 0.0%
 

usage_rev_acrtn
Numeric

Distinct count 3874
Unique (%) 0.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.38369
Minimum -1315.4
Maximum 2719.4
Zeros (%) 90.0%

Quantile statistics

Minimum -1315.4
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.34
Maximum 2719.4
Range 4034.8
Interquartile range 0

Descriptive statistics

Standard deviation 12.18
Coef of variation 31.744
Kurtosis 11184
Mean 0.38369
MAD 0.83164
Skewness 79.753
Sum 282330
Variance 148.34
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 662526 90.0%
 
0.09 2354 0.3%
 
0.07 2325 0.3%
 
0.08 1919 0.3%
 
0.1 1433 0.2%
 
0.5 1168 0.2%
 
0.48 1043 0.1%
 
0.01 899 0.1%
 
0.4 802 0.1%
 
0.33 802 0.1%
 
Other values (3864) 60560 8.2%
 

Minimum 5 values

Value Count Frequency (%)  
-1315.37 1 0.0%
 
-1040.2 1 0.0%
 
-377.09 1 0.0%
 
-345.0 1 0.0%
 
-277.5 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1728.32 1 0.0%
 
2031.54 1 0.0%
 
2115.0 1 0.0%
 
2130.18 1 0.0%
 
2719.42 1 0.0%
 

voice_overage_amt
Numeric

Distinct count 3432
Unique (%) 0.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.37214
Minimum 0
Maximum 1376.9
Zeros (%) 91.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.48
Maximum 1376.9
Range 1376.9
Interquartile range 0

Descriptive statistics

Standard deviation 7.1045
Coef of variation 19.091
Kurtosis 9373.4
Mean 0.37214
MAD 0.69345
Skewness 80.308
Sum 273830
Variance 50.473
Memory size 5.6 MiB
Value Count Frequency (%)  
0.0 672735 91.4%
 
0.07 1944 0.3%
 
0.06 1763 0.2%
 
0.09 1415 0.2%
 
0.12 1240 0.2%
 
0.14 1083 0.1%
 
0.08 1028 0.1%
 
0.13 817 0.1%
 
0.16 751 0.1%
 
0.1 716 0.1%
 
Other values (3422) 52339 7.1%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 672735 91.4%
 
0.01 40 0.0%
 
0.02 46 0.0%
 
0.03 316 0.0%
 
0.04 695 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
1079.37 1 0.0%
 
1100.32 1 0.0%
 
1110.7 1 0.0%
 
1149.16 1 0.0%
 
1376.9 1 0.0%
 

zip5_cd
Numeric

Distinct count 6513
Unique (%) 0.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 70121
Minimum 907
Maximum 99999
Zeros (%) 0.0%

Quantile statistics

Minimum 907
5-th percentile 10950
Q1 75117
Median 79766
Q3 92841
95-th percentile 95608
Maximum 99999
Range 99092
Interquartile range 17724

Descriptive statistics

Standard deviation 32310
Coef of variation 0.46078
Kurtosis -0.48877
Mean 70121
MAD 26505
Skewness -1.1407
Sum 51597074521
Variance 1044000000
Memory size 5.6 MiB
Value Count Frequency (%)  
79936 1251 0.2%
 
77494 1194 0.2%
 
79938 1189 0.2%
 
92345 1071 0.1%
 
79912 1049 0.1%
 
92336 1000 0.1%
 
95076 992 0.1%
 
77573 927 0.1%
 
77449 918 0.1%
 
77433 901 0.1%
 
Other values (6503) 725339 98.6%
 

Minimum 5 values

Value Count Frequency (%)  
907 1 0.0%
 
1930 1 0.0%
 
6390 2 0.0%
 
7002 1 0.0%
 
7017 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
98498 1 0.0%
 
98832 1 0.0%
 
98840 1 0.0%
 
99505 1 0.0%
 
99999 1 0.0%
 

Correlations

Sample

cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size VoiceTextData_pplangrp_cnt VoiceData_revtype_cnt active_loan_cnt birth_year zip5_cd Loand_equip_cnt BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_tot_service_rev_amt gl_recur_rev_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt
0 1 51 25 8012.49 5 0 5 5 5 15 1966 12309 3 0 5 3 5 0 0 0 3 5 0 0 1 0 0 2 0 0 1 5 0 2 0 0 0 0 2 0 320.50 5.00 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 1 51 21 5977.00 0 0 3 3 3 9 1993 76104 3 0 3 3 3 0 0 0 3 2 0 0 0 3 0 0 0 3 0 0 3 0 1 0 0 0 0 0 284.62 96.97 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 1 51 22 5842.94 0 0 4 4 4 16 1971 12601 4 0 4 0 4 0 0 0 4 4 4 0 0 0 0 2 0 0 0 0 4 0 4 0 4 0 0 0 265.59 5.40 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 1 51 21 6018.78 0 0 2 2 2 4 1984 95337 2 2 0 2 0 2 0 0 2 0 0 0 0 0 2 0 0 2 0 2 0 0 2 0 0 0 0 0 286.61 0.00 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 1 51 26 7135.08 4 0 4 4 4 12 1960 14883 4 0 4 3 4 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 1 274.43 15.00 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0

Correlation Check

In [22]:
f, ax = plt.subplots(figsize=(20, 12))
corr = df.corr()
hm = sns.heatmap(round(corr,2), annot=True, ax=ax, cmap="coolwarm",fmt='.2f',
            linewidths=.05)
f.subplots_adjust(top=0.93)
t= f.suptitle('Correlations between Features\n', fontsize=10)
In [23]:
corr_matrix = df.corr().abs()

#the matrix is symmetric so we need to extract upper triangle matrix without diagonal (k = 1)
sol = (corr_matrix.where(np.triu(np.ones(corr_matrix.shape), k=1).astype(np.bool))
                 .stack()
                 .sort_values(ascending=False))
#first element of sol series is the pair with the bigest correlation
In [24]:
sol
Out[24]:
gl_tot_service_rev_amt      gl_recur_rev_amt              0.991984
monetary                    gl_tot_service_rev_amt        0.956043
billed_acct_size            VoiceTextData_pplangrp_cnt    0.948930
monetary                    gl_recur_rev_amt              0.948818
VoiceTextData_pplangrp_cnt  VoiceData_revtype_cnt         0.926589
billed_acct_size            VoiceData_revtype_cnt         0.925498
Loand_equip_cnt             Edge_contract_cnt             0.918416
recency                     frequency                     0.914661
SingleLine_plan_cnt         VoiceOnly_pplangrp_cnt        0.884635
cust_status_ind             recency                       0.879735
active_loan_cnt             Edge_contract_cnt             0.878532
                            Loand_equip_cnt               0.874786
billed_acct_size            active_loan_cnt               0.850371
VoiceTextData_pplangrp_cnt  active_loan_cnt               0.833709
VoiceData_revtype_cnt       active_loan_cnt               0.831770
cust_status_ind             frequency                     0.826603
VoiceData_revtype_cnt       Loand_equip_cnt               0.783133
VoiceTextData_pplangrp_cnt  Loand_equip_cnt               0.779297
billed_acct_size            Loand_equip_cnt               0.768819
                            Edge_contract_cnt             0.761489
VoiceData_revtype_cnt       Edge_contract_cnt             0.761278
VoiceTextData_pplangrp_cnt  Edge_contract_cnt             0.746970
monetary                    arpu_amt                      0.745987
billed_acct_size            arpu_amt                      0.733814
Edge_contract_cnt           arpu_amt                      0.732657
VoiceData_revtype_cnt       arpu_amt                      0.726980
Loand_equip_cnt             arpu_amt                      0.726792
VoiceTextData_pplangrp_cnt  arpu_amt                      0.721092
VoiceData_revtype_cnt       MTM_contract_cnt              0.709279
tot_revenue_acrtn           eqp_rev_acrtn                 0.708468
                                                            ...   
Purch_equip_cnt             voice_overage_amt             0.001793
SingleLine_plan_cnt         usage_rev_acrtn               0.001793
data_ovrg_Yes_cnt           FamTeens_cnt                  0.001732
VoiceOnly_pplangrp_cnt      voice_overage_amt             0.001714
BeyondUnlmtd_plan_cnt       usage_rev_acrtn               0.001648
ATT_port_cnt                voice_overage_amt             0.001633
DataOnly_Revtype_cnt        usage_rev_acrtn               0.001525
VoiceOnly_pplangrp_cnt      usage_rev_acrtn               0.001474
Hispanic_ethnicity_cnt      usage_rev_acrtn               0.001474
zip5_cd                     usage_rev_acrtn               0.001443
TX_state_cd                 usage_rev_acrtn               0.001388
DataOnly_Revtype_cnt        voice_overage_amt             0.001367
frequency                   T_Mobile_port_cnt             0.001243
MatureAdults_cnt            access_rev_acrtn              0.001203
Edge_contract_cnt           BasicDev_cnt                  0.001193
MTM_contract_cnt            usage_rev_acrtn               0.001051
Sprint_port_cnt             eqp_rev_acrtn                 0.000977
VoiceOnly_pplangrp_cnt      LocalRetailer_chnl_cnt        0.000932
birth_year                  pplan_access_acrtn            0.000877
Sprint_port_cnt             voice_overage_amt             0.000840
FamTeens_cnt                voice_overage_amt             0.000781
SingleLine_plan_cnt         voice_overage_amt             0.000734
birth_year                  CA_state_cd                   0.000668
handset_paychnl_cnt         eqp_rev_acrtn                 0.000628
DataOnly_Revtype_cnt        BasicDev_cnt                  0.000568
birth_year                  LocalRetailer_chnl_cnt        0.000383
TwoYear_contract_cnt        access_rev_acrtn              0.000367
birth_year                  usage_rev_acrtn               0.000303
gl_late_fees_amt            data_overage_amt              0.000235
TwoYear_contract_cnt        tot_revenue_acrtn             0.000228
Length: 1326, dtype: float64

We see that these are the highly correlated features, so we have to remove them:

gl_tot_service_rev_amt      gl_recur_rev_amt              0.991984
monetary                    gl_tot_service_rev_amt        0.956043
billed_acct_size            VoiceTextData_pplangrp_cnt    0.948930
monetary                    gl_recur_rev_amt              0.948818
VoiceTextData_pplangrp_cnt  VoiceData_revtype_cnt         0.926589
billed_acct_size            VoiceData_revtype_cnt         0.925498
Loand_equip_cnt             Edge_contract_cnt             0.918416

Let us remove:

gl_tot_service_rev_amt
VoiceTextData_pplangrp_cnt
gl_recur_rev_amt
VoiceData_revtype_cnt
Loand_equip_cnt
In [25]:
df = df.drop(['gl_tot_service_rev_amt',
    'VoiceTextData_pplangrp_cnt',
    'gl_recur_rev_amt',
    'VoiceData_revtype_cnt',
    'Loand_equip_cnt'], axis =1)
In [26]:
df.head()
Out[26]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year zip5_cd ... arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt
0 1 51 25 8012.49 5 0 5 15 1966 12309 ... 320.50 5.00 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 1 51 21 5977.00 0 0 3 9 1993 76104 ... 284.62 96.97 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 1 51 22 5842.94 0 0 4 16 1971 12601 ... 265.59 5.40 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 1 51 21 6018.78 0 0 2 4 1984 95337 ... 286.61 0.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 1 51 26 7135.08 4 0 4 12 1960 14883 ... 274.43 15.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0

5 rows × 47 columns

Dropping Zip codes, as machine treats zip code with higher value as greater class:

In [27]:
df.zip5_cd.head()
Out[27]:
0    12309
1    76104
2    12601
3    95337
4    14883
Name: zip5_cd, dtype: int64
In [28]:
df = df.drop('zip5_cd', axis = 1)

K Means Clustering:

In [29]:
X3 = df.iloc[: , :].values
In [30]:
X3
Out[30]:
array([[ 1., 51., 25., ...,  0.,  0.,  0.],
       [ 1., 51., 21., ...,  0.,  0.,  0.],
       [ 1., 51., 22., ...,  0.,  0.,  0.],
       ...,
       [ 1., 51., 30., ...,  0.,  0.,  0.],
       [ 1., 51., 22., ...,  0.,  0.,  0.],
       [ 1., 51., 24., ...,  0.,  0.,  0.]])

Determining the Number of clusters using Elbow Curve:

In [31]:
# k means determine k
from sklearn.cluster import KMeans
from sklearn import metrics
from scipy.spatial.distance import cdist

distortions = []
K = range(1,10)
for k in K:
    kmeanModel = KMeans(n_clusters=k).fit(X3)
    kmeanModel.fit(X3)
    distortions.append(sum(np.min(cdist(X3, kmeanModel.cluster_centers_, 'euclidean'), axis=1)) / X3.shape[0])

# Plot the elbow
plt.plot(K, distortions, 'bx-')
plt.xlabel('k')
plt.ylabel('Distortion')
plt.title('The Elbow Method showing the optimal k')
plt.show()

The Elbow appears somewhere between 3 and 4. We shall choose 4 clusters to improve granularity and better segmentation.

In [32]:
from sklearn.cluster import KMeans
kmeans = KMeans(n_clusters=4, init='k-means++', random_state=0)
clusters = kmeans.fit_predict(X3)
In [33]:
df['Customer_Cluster'] = clusters

Checking the Attributes of Each Cluster:

In [34]:
df['Customer_Cluster'].value_counts()
Out[34]:
0    287071
2    196863
3    160725
1     91172
Name: Customer_Cluster, dtype: int64
In [35]:
sns.countplot('Customer_Cluster',data=df)
Out[35]:
<matplotlib.axes._subplots.AxesSubplot at 0x1dc0268b320>
In [36]:
fig = plt.figure(figsize=(8,6))
dx = fig.add_subplot(111, projection='3d')
colors = ['silver','midnightblue', 'maroon', 'gold']

for i in range(0,4):
    dx.scatter(df[df.Customer_Cluster == i].recency, 
               df[df.Customer_Cluster == i].frequency, 
               df[df.Customer_Cluster == i].monetary, 
               c = colors[i], 
               label = 'Cluster ' + str(i+1), 
               s=50)

dx.set_title('Clusters of clients')
dx.set_xlabel('Recency')
dx.set_ylabel('Frequency')
dx.set_zlabel('Monetary')
dx.legend()
Out[36]:
<matplotlib.legend.Legend at 0x1dc024cd780>

It is difficult to visualize the clusters in 3 dimensions using just Recency, Frequency and Monetary values. Let us look at the Attributes of customers falling into each of these clusters:

Visualizing the Clusters in 3 Dimensions:

In [37]:
import plotly.graph_objs as go
import plotly as py

warnings.filterwarnings("ignore")
style.use('classic')
py.offline.init_notebook_mode(connected = True)
df = df.sample(frac=0.1, random_state=1)

trace1 = go.Scatter3d(
    x= df['recency'],
    y= df['frequency'],
    z= df['monetary'],
    mode='markers',
     marker=dict(
        color = df['Customer_Cluster'], 
        size= 20,
        line=dict(
            color= df['Customer_Cluster'],
            width= 12
        ),
        opacity=0.8
     )
)
data = [trace1]
layout = go.Layout(
#     margin=dict(
#         l=0,
#         r=0,
#         b=0,
#         t=0
#     )
    title= 'Customer_Cluster',
    scene = dict(
            xaxis = dict(title  = 'Recency'),
            yaxis = dict(title  = 'Frequency'),
            zaxis = dict(title  = 'Monetary Value')
        )
)
fig = go.Figure(data=data, layout=layout)
py.offline.iplot(fig)

Cluster 0:

In [38]:
pd.set_option('display.max_columns', 500)
In [39]:
df[df['Customer_Cluster']== 0].head(10)
Out[39]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
56841 1 51 27 2164.37 1 0 1 1 1952 0 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 80.16 5.0 6.99 1.30 -1.61 -0.29 0.00 0.00 0.00 0.00 0
237065 1 51 27 1828.20 2 0 1 2 1932 0 0 0 0 0 0 0 2 2 2 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 67.71 0.0 0.00 1.48 -3.27 0.00 -0.78 0.00 0.00 0.00 0
62699 1 51 24 2805.00 1 1 1 1 1991 0 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0 0 0 116.88 5.0 -46.28 1.67 -0.10 -1.04 2.00 0.00 1.25 0.00 0
486254 1 51 29 3088.69 1 0 1 1 1962 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 106.51 70.0 370.00 0.00 0.76 -1.21 -0.10 0.00 10.34 0.00 0
104869 1 51 20 3415.30 4 0 4 4 1983 0 0 4 4 0 0 1 1 4 0 0 0 0 4 2 0 0 0 4 0 3 4 0 0 0 0 0 170.76 0.0 62.40 14.00 -1.30 1.14 8.20 0.75 1.50 0.81 0
586227 0 201 21 1861.50 3 0 3 0 1973 0 0 0 0 3 3 0 0 0 2 0 0 0 0 1 0 0 0 3 0 1 2 0 0 3 1 0 88.64 15.0 15.00 5.00 -9.95 -1.57 0.00 0.00 0.00 0.00 0
43505 0 477 13 2863.65 0 2 2 4 1976 0 0 0 2 0 0 0 2 0 2 0 0 0 0 0 0 2 0 2 2 0 2 0 2 2 0 0 220.28 0.0 0.00 16.92 -5.58 0.00 -0.02 0.00 0.00 0.00 0
427692 1 51 28 2792.51 0 1 1 1 1987 0 0 1 1 0 0 0 1 1 0 0 0 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 99.73 0.0 -248.63 2.32 1.20 0.00 0.83 0.00 0.00 0.00 0
561895 1 51 22 2789.65 0 0 1 1 1973 1 0 1 1 0 0 0 1 1 1 0 0 1 0 0 0 1 0 1 0 0 1 1 0 0 0 0 126.80 20.0 64.75 0.00 2.04 0.00 2.75 0.00 0.00 2.03 0
526904 1 51 22 3572.31 0 0 2 2 1981 2 1 2 2 2 0 0 1 0 0 0 0 0 2 0 0 0 0 2 0 1 2 0 0 2 0 0 162.38 0.0 1.99 0.00 -6.95 0.00 0.85 0.00 0.00 0.00 0

Cluster 0:

In [40]:
df.head()
Out[40]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
56841 1 51 27 2164.37 1 0 1 1 1952 0 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 80.16 5.0 6.99 1.30 -1.61 -0.29 0.00 0.0 0.00 0.0 0
13719 1 51 28 6943.23 0 4 4 8 1975 0 0 3 0 0 0 0 2 0 2 0 0 4 0 0 0 0 0 4 4 2 4 0 0 0 0 0 247.97 35.0 35.00 15.71 -0.92 0.00 1.49 0.0 0.00 0.0 1
237065 1 51 27 1828.20 2 0 1 2 1932 0 0 0 0 0 0 0 2 2 2 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 67.71 0.0 0.00 1.48 -3.27 0.00 -0.78 0.0 0.00 0.0 0
62699 1 51 24 2805.00 1 1 1 1 1991 0 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0 0 0 116.88 5.0 -46.28 1.67 -0.10 -1.04 2.00 0.0 1.25 0.0 0
283582 1 51 21 4451.09 0 0 5 15 1984 4 2 3 5 0 0 0 3 4 0 2 0 5 0 0 0 2 0 4 0 1 1 0 0 0 0 0 211.96 0.0 0.00 0.71 -15.14 9.29 -1.10 0.0 0.00 0.0 3
In [41]:
import pandas_profiling
profile1 = pandas_profiling.ProfileReport(df[df['Customer_Cluster']== 0])
profile1
Out[41]:

Overview

Dataset info

Number of variables 48
Number of observations 28578
Total Missing (%) 0.0%
Total size in memory 10.4 MiB
Average record size in memory 380.0 B

Variables types

Numeric 46
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 1
Unsupported 0

Warnings

Variables

ATT_port_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.26359
Minimum 0
Maximum 7
Zeros (%) 79.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.58797
Coef of variation 2.2306
Kurtosis 9.8
Mean 0.26359
MAD 0.41966
Skewness 2.7366
Sum 7533
Variance 0.34571
Memory size 223.3 KiB
Value Count Frequency (%)  
0 22749 79.6%
 
1 4464 15.6%
 
2 1123 3.9%
 
3 166 0.6%
 
4 59 0.2%
 
5 14 0.0%
 
6 2 0.0%
 
7 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 22749 79.6%
 
1 4464 15.6%
 
2 1123 3.9%
 
3 166 0.6%
 
4 59 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
3 166 0.6%
 
4 59 0.2%
 
5 14 0.0%
 
6 2 0.0%
 
7 1 0.0%
 

AndroidDev_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.55021
Minimum 0
Maximum 11
Zeros (%) 57.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 0.75766
Coef of variation 1.377
Kurtosis 5.6502
Mean 0.55021
MAD 0.63273
Skewness 1.7372
Sum 15724
Variance 0.57404
Memory size 223.3 KiB
Value Count Frequency (%)  
0 16432 57.5%
 
1 9328 32.6%
 
2 2281 8.0%
 
3 372 1.3%
 
4 129 0.5%
 
5 25 0.1%
 
6 7 0.0%
 
11 1 0.0%
 
9 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 16432 57.5%
 
1 9328 32.6%
 
2 2281 8.0%
 
3 372 1.3%
 
4 129 0.5%
 

Maximum 5 values

Value Count Frequency (%)  
6 7 0.0%
 
7 1 0.0%
 
8 1 0.0%
 
9 1 0.0%
 
11 1 0.0%
 

BasicDev_cnt
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.041745
Minimum 0
Maximum 5
Zeros (%) 96.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 5
Range 5
Interquartile range 0

Descriptive statistics

Standard deviation 0.23463
Coef of variation 5.6205
Kurtosis 56.223
Mean 0.041745
MAD 0.080555
Skewness 6.7243
Sum 1193
Variance 0.055051
Memory size 223.3 KiB
Value Count Frequency (%)  
0 27573 96.5%
 
1 839 2.9%
 
2 148 0.5%
 
3 15 0.1%
 
4 2 0.0%
 
5 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 27573 96.5%
 
1 839 2.9%
 
2 148 0.5%
 
3 15 0.1%
 
4 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 839 2.9%
 
2 148 0.5%
 
3 15 0.1%
 
4 2 0.0%
 
5 1 0.0%
 

BeyondUnlmtd_plan_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.23095
Minimum 0
Maximum 10
Zeros (%) 86.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.68425
Coef of variation 2.9628
Kurtosis 22.628
Mean 0.23095
MAD 0.39711
Skewness 4.0998
Sum 6600
Variance 0.4682
Memory size 223.3 KiB
Value Count Frequency (%)  
0 24570 86.0%
 
1 2346 8.2%
 
2 1106 3.9%
 
3 335 1.2%
 
4 131 0.5%
 
5 54 0.2%
 
6 18 0.1%
 
7 12 0.0%
 
8 4 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 24570 86.0%
 
1 2346 8.2%
 
2 1106 3.9%
 
3 335 1.2%
 
4 131 0.5%
 

Maximum 5 values

Value Count Frequency (%)  
6 18 0.1%
 
7 12 0.0%
 
8 4 0.0%
 
9 1 0.0%
 
10 1 0.0%
 

CA_state_cd
Numeric

Distinct count 13
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.84369
Minimum 0
Maximum 12
Zeros (%) 50.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 12
Range 12
Interquartile range 1

Descriptive statistics

Standard deviation 1.1184
Coef of variation 1.3256
Kurtosis 5.4008
Mean 0.84369
MAD 0.85467
Skewness 1.8389
Sum 24111
Variance 1.2508
Memory size 223.3 KiB
Value Count Frequency (%)  
0 14475 50.7%
 
1 7666 26.8%
 
2 4254 14.9%
 
3 1375 4.8%
 
4 474 1.7%
 
5 198 0.7%
 
6 73 0.3%
 
7 38 0.1%
 
8 13 0.0%
 
9 6 0.0%
 
Other values (3) 6 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14475 50.7%
 
1 7666 26.8%
 
2 4254 14.9%
 
3 1375 4.8%
 
4 474 1.7%
 

Maximum 5 values

Value Count Frequency (%)  
8 13 0.0%
 
9 6 0.0%
 
10 3 0.0%
 
11 2 0.0%
 
12 1 0.0%
 

CommunicStore_chnl_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.70806
Minimum 0
Maximum 16
Zeros (%) 54.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 16
Range 16
Interquartile range 1

Descriptive statistics

Standard deviation 0.98221
Coef of variation 1.3872
Kurtosis 7.845
Mean 0.70806
MAD 0.76703
Skewness 2.0234
Sum 20235
Variance 0.96474
Memory size 223.3 KiB
Value Count Frequency (%)  
0 15479 54.2%
 
1 8231 28.8%
 
2 3416 12.0%
 
3 958 3.4%
 
4 317 1.1%
 
5 98 0.3%
 
6 44 0.2%
 
7 20 0.1%
 
8 8 0.0%
 
9 5 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15479 54.2%
 
1 8231 28.8%
 
2 3416 12.0%
 
3 958 3.4%
 
4 317 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 20 0.1%
 
8 8 0.0%
 
9 5 0.0%
 
11 1 0.0%
 
16 1 0.0%
 

Customer_Cluster
Constant

This variable is constant and should be ignored for analysis

Constant value 0

DataOnly_Revtype_cnt
Numeric

Distinct count 9
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.2721
Minimum 0
Maximum 10
Zeros (%) 77.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.56937
Coef of variation 2.0925
Kurtosis 13.043
Mean 0.2721
MAD 0.42196
Skewness 2.7696
Sum 7776
Variance 0.32418
Memory size 223.3 KiB
Value Count Frequency (%)  
0 22159 77.5%
 
1 5361 18.8%
 
2 849 3.0%
 
3 151 0.5%
 
4 38 0.1%
 
5 14 0.0%
 
6 3 0.0%
 
7 2 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 22159 77.5%
 
1 5361 18.8%
 
2 849 3.0%
 
3 151 0.5%
 
4 38 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
4 38 0.1%
 
5 14 0.0%
 
6 3 0.0%
 
7 2 0.0%
 
10 1 0.0%
 

Edge_contract_cnt
Numeric

Distinct count 10
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1552
Minimum 0
Maximum 9
Zeros (%) 13.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 1
95-th percentile 2
Maximum 9
Range 9
Interquartile range 0

Descriptive statistics

Standard deviation 0.8037
Coef of variation 0.69573
Kurtosis 7.0283
Mean 1.1552
MAD 0.52068
Skewness 1.7732
Sum 33013
Variance 0.64593
Memory size 223.3 KiB
Value Count Frequency (%)  
1 18405 64.4%
 
2 4826 16.9%
 
0 3968 13.9%
 
3 829 2.9%
 
4 351 1.2%
 
5 151 0.5%
 
6 34 0.1%
 
7 8 0.0%
 
8 4 0.0%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 3968 13.9%
 
1 18405 64.4%
 
2 4826 16.9%
 
3 829 2.9%
 
4 351 1.2%
 

Maximum 5 values

Value Count Frequency (%)  
5 151 0.5%
 
6 34 0.1%
 
7 8 0.0%
 
8 4 0.0%
 
9 2 0.0%
 

FamTeens_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.44132
Minimum 0
Maximum 17
Zeros (%) 71.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 17
Range 17
Interquartile range 1

Descriptive statistics

Standard deviation 0.85118
Coef of variation 1.9287
Kurtosis 13.791
Mean 0.44132
MAD 0.63169
Skewness 2.7641
Sum 12612
Variance 0.7245
Memory size 223.3 KiB
Value Count Frequency (%)  
0 20453 71.6%
 
1 5074 17.8%
 
2 2139 7.5%
 
3 584 2.0%
 
4 212 0.7%
 
5 72 0.3%
 
6 27 0.1%
 
7 11 0.0%
 
8 3 0.0%
 
10 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 20453 71.6%
 
1 5074 17.8%
 
2 2139 7.5%
 
3 584 2.0%
 
4 212 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
6 27 0.1%
 
7 11 0.0%
 
8 3 0.0%
 
10 2 0.0%
 
17 1 0.0%
 

GoUnlimit_plan_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.55263
Minimum 0
Maximum 12
Zeros (%) 67.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 12
Range 12
Interquartile range 1

Descriptive statistics

Standard deviation 0.97929
Coef of variation 1.7721
Kurtosis 6.7127
Mean 0.55263
MAD 0.74836
Skewness 2.2602
Sum 15793
Variance 0.959
Memory size 223.3 KiB
Value Count Frequency (%)  
0 19350 67.7%
 
1 4943 17.3%
 
2 2858 10.0%
 
3 877 3.1%
 
4 352 1.2%
 
5 131 0.5%
 
6 43 0.2%
 
7 16 0.1%
 
8 5 0.0%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 19350 67.7%
 
1 4943 17.3%
 
2 2858 10.0%
 
3 877 3.1%
 
4 352 1.2%
 

Maximum 5 values

Value Count Frequency (%)  
6 43 0.2%
 
7 16 0.1%
 
8 5 0.0%
 
9 2 0.0%
 
12 1 0.0%
 

Hispanic_ethnicity_cnt
Numeric

Distinct count 10
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.42403
Minimum 0
Maximum 9
Zeros (%) 75.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 9
Range 9
Interquartile range 0

Descriptive statistics

Standard deviation 0.91764
Coef of variation 2.1641
Kurtosis 10.627
Mean 0.42403
MAD 0.64212
Skewness 2.8863
Sum 12118
Variance 0.84206
Memory size 223.3 KiB
Value Count Frequency (%)  
0 21638 75.7%
 
1 3764 13.2%
 
2 2012 7.0%
 
3 663 2.3%
 
4 287 1.0%
 
5 136 0.5%
 
6 47 0.2%
 
7 22 0.1%
 
9 5 0.0%
 
8 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 21638 75.7%
 
1 3764 13.2%
 
2 2012 7.0%
 
3 663 2.3%
 
4 287 1.0%
 

Maximum 5 values

Value Count Frequency (%)  
5 136 0.5%
 
6 47 0.2%
 
7 22 0.1%
 
8 4 0.0%
 
9 5 0.0%
 

IphoneDev_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.89166
Minimum 0
Maximum 11
Zeros (%) 32.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 1
95-th percentile 2
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 0.83034
Coef of variation 0.93122
Kurtosis 6.0522
Mean 0.89166
MAD 0.57329
Skewness 1.5498
Sum 25482
Variance 0.68946
Memory size 223.3 KiB
Value Count Frequency (%)  
1 14857 52.0%
 
0 9187 32.1%
 
2 3516 12.3%
 
3 656 2.3%
 
4 248 0.9%
 
5 76 0.3%
 
6 25 0.1%
 
8 6 0.0%
 
7 5 0.0%
 
11 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 9187 32.1%
 
1 14857 52.0%
 
2 3516 12.3%
 
3 656 2.3%
 
4 248 0.9%
 

Maximum 5 values

Value Count Frequency (%)  
6 25 0.1%
 
7 5 0.0%
 
8 6 0.0%
 
9 1 0.0%
 
11 1 0.0%
 

LocalRetailer_chnl_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.28711
Minimum 0
Maximum 12
Zeros (%) 81.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 12
Range 12
Interquartile range 0

Descriptive statistics

Standard deviation 0.72559
Coef of variation 2.5272
Kurtosis 19.533
Mean 0.28711
MAD 0.46893
Skewness 3.6118
Sum 8205
Variance 0.52648
Memory size 223.3 KiB
Value Count Frequency (%)  
0 23338 81.7%
 
1 3232 11.3%
 
2 1425 5.0%
 
3 366 1.3%
 
4 138 0.5%
 
5 38 0.1%
 
6 20 0.1%
 
7 12 0.0%
 
8 6 0.0%
 
12 1 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 23338 81.7%
 
1 3232 11.3%
 
2 1425 5.0%
 
3 366 1.3%
 
4 138 0.5%
 

Maximum 5 values

Value Count Frequency (%)  
7 12 0.0%
 
8 6 0.0%
 
9 1 0.0%
 
10 1 0.0%
 
12 1 0.0%
 

MTM_contract_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.0764
Minimum 0
Maximum 16
Zeros (%) 21.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 1
95-th percentile 3
Maximum 16
Range 16
Interquartile range 0

Descriptive statistics

Standard deviation 0.87047
Coef of variation 0.80867
Kurtosis 9.3434
Mean 1.0764
MAD 0.54139
Skewness 1.8619
Sum 30762
Variance 0.75771
Memory size 223.3 KiB
Value Count Frequency (%)  
1 16786 58.7%
 
0 5995 21.0%
 
2 4315 15.1%
 
3 900 3.1%
 
4 382 1.3%
 
5 128 0.4%
 
6 47 0.2%
 
7 14 0.0%
 
8 8 0.0%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 5995 21.0%
 
1 16786 58.7%
 
2 4315 15.1%
 
3 900 3.1%
 
4 382 1.3%
 

Maximum 5 values

Value Count Frequency (%)  
6 47 0.2%
 
7 14 0.0%
 
8 8 0.0%
 
9 2 0.0%
 
16 1 0.0%
 

MatureAdults_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.39278
Minimum 0
Maximum 11
Zeros (%) 77.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 11
Range 11
Interquartile range 0

Descriptive statistics

Standard deviation 0.85278
Coef of variation 2.1711
Kurtosis 9.4321
Mean 0.39278
MAD 0.60755
Skewness 2.6967
Sum 11225
Variance 0.72723
Memory size 223.3 KiB
Value Count Frequency (%)  
0 22102 77.3%
 
1 3191 11.2%
 
2 2298 8.0%
 
3 690 2.4%
 
4 187 0.7%
 
5 66 0.2%
 
6 28 0.1%
 
7 10 0.0%
 
8 4 0.0%
 
11 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 22102 77.3%
 
1 3191 11.2%
 
2 2298 8.0%
 
3 690 2.4%
 
4 187 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
6 28 0.1%
 
7 10 0.0%
 
8 4 0.0%
 
9 1 0.0%
 
11 1 0.0%
 

NewVZPlan_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.85566
Minimum 0
Maximum 11
Zeros (%) 45.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 1
95-th percentile 3
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 0.98719
Coef of variation 1.1537
Kurtosis 3.6624
Mean 0.85566
MAD 0.77015
Skewness 1.4212
Sum 24453
Variance 0.97455
Memory size 223.3 KiB
Value Count Frequency (%)  
0 12861 45.0%
 
1 9345 32.7%
 
2 4675 16.4%
 
3 1256 4.4%
 
4 310 1.1%
 
5 74 0.3%
 
6 37 0.1%
 
7 11 0.0%
 
8 5 0.0%
 
11 2 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 12861 45.0%
 
1 9345 32.7%
 
2 4675 16.4%
 
3 1256 4.4%
 
4 310 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 11 0.0%
 
8 5 0.0%
 
9 1 0.0%
 
10 1 0.0%
 
11 2 0.0%
 

Purch_equip_cnt
Numeric

Distinct count 9
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.47533
Minimum 0
Maximum 10
Zeros (%) 65.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.77475
Coef of variation 1.6299
Kurtosis 6.4845
Mean 0.47533
MAD 0.6211
Skewness 2.0855
Sum 13584
Variance 0.60024
Memory size 223.3 KiB
Value Count Frequency (%)  
0 18671 65.3%
 
1 7173 25.1%
 
2 2055 7.2%
 
3 498 1.7%
 
4 128 0.4%
 
5 33 0.1%
 
6 13 0.0%
 
7 6 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 18671 65.3%
 
1 7173 25.1%
 
2 2055 7.2%
 
3 498 1.7%
 
4 128 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
4 128 0.4%
 
5 33 0.1%
 
6 13 0.0%
 
7 6 0.0%
 
10 1 0.0%
 

SingleLine_plan_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.028693
Minimum 0
Maximum 4
Zeros (%) 97.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.19695
Coef of variation 6.8639
Kurtosis 86.175
Mean 0.028693
MAD 0.056005
Skewness 8.346
Sum 820
Variance 0.038789
Memory size 223.3 KiB
Value Count Frequency (%)  
0 27890 97.6%
 
1 577 2.0%
 
2 93 0.3%
 
3 15 0.1%
 
4 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 27890 97.6%
 
1 577 2.0%
 
2 93 0.3%
 
3 15 0.1%
 
4 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 27890 97.6%
 
1 577 2.0%
 
2 93 0.3%
 
3 15 0.1%
 
4 3 0.0%
 

SngleCpleKids_cnt
Numeric

Distinct count 13
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.63195
Minimum 0
Maximum 12
Zeros (%) 61.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 12
Range 12
Interquartile range 1

Descriptive statistics

Standard deviation 1.0024
Coef of variation 1.5862
Kurtosis 7.6521
Mean 0.63195
MAD 0.7718
Skewness 2.2415
Sum 18060
Variance 1.0048
Memory size 223.3 KiB
Value Count Frequency (%)  
0 17451 61.1%
 
1 6625 23.2%
 
2 3043 10.6%
 
3 893 3.1%
 
4 325 1.1%
 
5 147 0.5%
 
6 54 0.2%
 
7 24 0.1%
 
8 7 0.0%
 
9 6 0.0%
 
Other values (3) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17451 61.1%
 
1 6625 23.2%
 
2 3043 10.6%
 
3 893 3.1%
 
4 325 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
8 7 0.0%
 
9 6 0.0%
 
10 1 0.0%
 
11 1 0.0%
 
12 1 0.0%
 

Sprint_port_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.14378
Minimum 0
Maximum 7
Zeros (%) 88.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.44583
Coef of variation 3.1008
Kurtosis 23.307
Mean 0.14378
MAD 0.25423
Skewness 4.1046
Sum 4109
Variance 0.19877
Memory size 223.3 KiB
Value Count Frequency (%)  
0 25265 88.4%
 
1 2719 9.5%
 
2 459 1.6%
 
3 80 0.3%
 
4 46 0.2%
 
5 7 0.0%
 
7 1 0.0%
 
6 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 25265 88.4%
 
1 2719 9.5%
 
2 459 1.6%
 
3 80 0.3%
 
4 46 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
3 80 0.3%
 
4 46 0.2%
 
5 7 0.0%
 
6 1 0.0%
 
7 1 0.0%
 

TX_state_cd
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.44849
Minimum 0
Maximum 19
Zeros (%) 73.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 19
Range 19
Interquartile range 1

Descriptive statistics

Standard deviation 0.91877
Coef of variation 2.0486
Kurtosis 14.527
Mean 0.44849
MAD 0.66252
Skewness 2.8764
Sum 12817
Variance 0.84413
Memory size 223.3 KiB
Value Count Frequency (%)  
0 21108 73.9%
 
1 4028 14.1%
 
2 2266 7.9%
 
3 727 2.5%
 
4 287 1.0%
 
5 96 0.3%
 
6 41 0.1%
 
7 12 0.0%
 
8 9 0.0%
 
9 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 21108 73.9%
 
1 4028 14.1%
 
2 2266 7.9%
 
3 727 2.5%
 
4 287 1.0%
 

Maximum 5 values

Value Count Frequency (%)  
6 41 0.1%
 
7 12 0.0%
 
8 9 0.0%
 
9 3 0.0%
 
19 1 0.0%
 

T_Mobile_port_cnt
Numeric

Distinct count 10
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.17555
Minimum 0
Maximum 18
Zeros (%) 87.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 18
Range 18
Interquartile range 0

Descriptive statistics

Standard deviation 0.54404
Coef of variation 3.099
Kurtosis 61.52
Mean 0.17555
MAD 0.30674
Skewness 5.1441
Sum 5017
Variance 0.29598
Memory size 223.3 KiB
Value Count Frequency (%)  
0 24967 87.4%
 
1 2635 9.2%
 
2 711 2.5%
 
3 153 0.5%
 
4 84 0.3%
 
5 18 0.1%
 
6 7 0.0%
 
8 1 0.0%
 
7 1 0.0%
 
18 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 24967 87.4%
 
1 2635 9.2%
 
2 711 2.5%
 
3 153 0.5%
 
4 84 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
5 18 0.1%
 
6 7 0.0%
 
7 1 0.0%
 
8 1 0.0%
 
18 1 0.0%
 

TwoYear_contract_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.21772
Minimum 0
Maximum 10
Zeros (%) 81.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.50192
Coef of variation 2.3054
Kurtosis 13.675
Mean 0.21772
MAD 0.35479
Skewness 2.868
Sum 6222
Variance 0.25193
Memory size 223.3 KiB
Value Count Frequency (%)  
0 23285 81.5%
 
1 4540 15.9%
 
2 614 2.1%
 
3 111 0.4%
 
4 24 0.1%
 
5 3 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 23285 81.5%
 
1 4540 15.9%
 
2 614 2.1%
 
3 111 0.4%
 
4 24 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
2 614 2.1%
 
3 111 0.4%
 
4 24 0.1%
 
5 3 0.0%
 
10 1 0.0%
 

VerizonUNL_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.46522
Minimum 0
Maximum 19
Zeros (%) 69.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 19
Range 19
Interquartile range 1

Descriptive statistics

Standard deviation 0.88571
Coef of variation 1.9039
Kurtosis 18.995
Mean 0.46522
MAD 0.64302
Skewness 3.176
Sum 13295
Variance 0.78447
Memory size 223.3 KiB
Value Count Frequency (%)  
0 19750 69.1%
 
1 6060 21.2%
 
2 1839 6.4%
 
3 526 1.8%
 
4 201 0.7%
 
5 114 0.4%
 
6 46 0.2%
 
7 25 0.1%
 
8 10 0.0%
 
9 5 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 19750 69.1%
 
1 6060 21.2%
 
2 1839 6.4%
 
3 526 1.8%
 
4 201 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
7 25 0.1%
 
8 10 0.0%
 
9 5 0.0%
 
10 1 0.0%
 
19 1 0.0%
 

VoiceOnly_pplangrp_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.026839
Minimum 0
Maximum 4
Zeros (%) 97.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.19172
Coef of variation 7.1435
Kurtosis 87.922
Mean 0.026839
MAD 0.052487
Skewness 8.5407
Sum 767
Variance 0.036757
Memory size 223.3 KiB
Value Count Frequency (%)  
0 27944 97.8%
 
1 518 1.8%
 
2 101 0.4%
 
3 13 0.0%
 
4 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 27944 97.8%
 
1 518 1.8%
 
2 101 0.4%
 
3 13 0.0%
 
4 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 27944 97.8%
 
1 518 1.8%
 
2 101 0.4%
 
3 13 0.0%
 
4 2 0.0%
 

WearablDev_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.075093
Minimum 0
Maximum 4
Zeros (%) 93.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.30215
Coef of variation 4.0236
Kurtosis 27.067
Mean 0.075093
MAD 0.14032
Skewness 4.7021
Sum 2146
Variance 0.091292
Memory size 223.3 KiB
Value Count Frequency (%)  
0 26701 93.4%
 
1 1646 5.8%
 
2 198 0.7%
 
3 28 0.1%
 
4 5 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 26701 93.4%
 
1 1646 5.8%
 
2 198 0.7%
 
3 28 0.1%
 
4 5 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 26701 93.4%
 
1 1646 5.8%
 
2 198 0.7%
 
3 28 0.1%
 
4 5 0.0%
 

access_rev_acrtn
Numeric

Distinct count 1734
Unique (%) 6.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -0.94655
Minimum -99.29
Maximum 23.97
Zeros (%) 45.9%

Quantile statistics

Minimum -99.29
5-th percentile -5.39
Q1 -1.39
Median 0
Q3 0
95-th percentile 1.67
Maximum 23.97
Range 123.26
Interquartile range 1.39

Descriptive statistics

Standard deviation 3.2476
Coef of variation -3.431
Kurtosis 110.23
Mean -0.94655
MAD 1.7035
Skewness -6.772
Sum -27050
Variance 10.547
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 13125 45.9%
 
-1.67 215 0.8%
 
-2.5 189 0.7%
 
-1.82 187 0.7%
 
-1.74 176 0.6%
 
-2.0 158 0.6%
 
-1.9 149 0.5%
 
-1.6 127 0.4%
 
-1.3 118 0.4%
 
-1.25 112 0.4%
 
Other values (1724) 14022 49.1%
 

Minimum 5 values

Value Count Frequency (%)  
-99.29 1 0.0%
 
-87.1 1 0.0%
 
-84.21 1 0.0%
 
-82.5 1 0.0%
 
-58.42 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
11.12 1 0.0%
 
11.6 1 0.0%
 
19.73 1 0.0%
 
22.0 1 0.0%
 
23.97 1 0.0%
 

active_loan_cnt
Numeric

Distinct count 41
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.3847
Minimum 0
Maximum 144
Zeros (%) 13.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 2
95-th percentile 8
Maximum 144
Range 144
Interquartile range 1

Descriptive statistics

Standard deviation 3.9114
Coef of variation 1.6402
Kurtosis 107.62
Mean 2.3847
MAD 2.0731
Skewness 7.0059
Sum 68151
Variance 15.299
Memory size 223.3 KiB
Value Count Frequency (%)  
1 13597 47.6%
 
2 4024 14.1%
 
0 3877 13.6%
 
4 3646 12.8%
 
6 957 3.3%
 
3 783 2.7%
 
9 516 1.8%
 
8 261 0.9%
 
16 202 0.7%
 
12 194 0.7%
 
Other values (31) 521 1.8%
 

Minimum 5 values

Value Count Frequency (%)  
0 3877 13.6%
 
1 13597 47.6%
 
2 4024 14.1%
 
3 783 2.7%
 
4 3646 12.8%
 

Maximum 5 values

Value Count Frequency (%)  
64 2 0.0%
 
66 1 0.0%
 
72 1 0.0%
 
81 1 0.0%
 
144 1 0.0%
 

arpu_amt
Numeric

Distinct count 11678
Unique (%) 40.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 128.89
Minimum 55.17
Maximum 1627.1
Zeros (%) 0.0%

Quantile statistics

Minimum 55.17
5-th percentile 77.638
Q1 100.17
Median 121.03
Q3 141.79
95-th percentile 207.15
Maximum 1627.1
Range 1572
Interquartile range 41.62

Descriptive statistics

Standard deviation 53.824
Coef of variation 0.41759
Kurtosis 89.759
Mean 128.89
MAD 31.187
Skewness 6.102
Sum 3683400
Variance 2897
Memory size 223.3 KiB
Value Count Frequency (%)  
110.84 13 0.0%
 
118.3 13 0.0%
 
108.96 10 0.0%
 
132.24 10 0.0%
 
105.24 10 0.0%
 
129.13 10 0.0%
 
122.58 10 0.0%
 
100.98 10 0.0%
 
123.97 10 0.0%
 
107.26 10 0.0%
 
Other values (11668) 28472 99.6%
 

Minimum 5 values

Value Count Frequency (%)  
55.17 1 0.0%
 
55.42 1 0.0%
 
55.64 1 0.0%
 
55.9 1 0.0%
 
56.01 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1148.49 1 0.0%
 
1244.26 1 0.0%
 
1413.26 1 0.0%
 
1469.17 1 0.0%
 
1627.14 1 0.0%
 

billed_acct_size
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.6474
Minimum 0
Maximum 10
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 1
Median 1
Q3 2
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.95216
Coef of variation 0.57799
Kurtosis 7.1538
Mean 1.6474
MAD 0.73135
Skewness 2.1754
Sum 47078
Variance 0.9066
Memory size 223.3 KiB
Value Count Frequency (%)  
1 16133 56.5%
 
2 8600 30.1%
 
3 2486 8.7%
 
4 828 2.9%
 
5 317 1.1%
 
6 126 0.4%
 
7 54 0.2%
 
8 18 0.1%
 
9 8 0.0%
 
10 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 4 0.0%
 
1 16133 56.5%
 
2 8600 30.1%
 
3 2486 8.7%
 
4 828 2.9%
 

Maximum 5 values

Value Count Frequency (%)  
6 126 0.4%
 
7 54 0.2%
 
8 18 0.1%
 
9 8 0.0%
 
10 4 0.0%
 

birth_year
Numeric

Distinct count 87
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1975
Minimum 0
Maximum 2000
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1945
Q1 1963
Median 1980
Q3 1990
95-th percentile 1997
Maximum 2000
Range 2000
Interquartile range 27

Descriptive statistics

Standard deviation 40.536
Coef of variation 0.020525
Kurtosis 1969.4
Mean 1975
MAD 14.921
Skewness -40.511
Sum 56441073
Variance 1643.1
Memory size 223.3 KiB
Value Count Frequency (%)  
1991 918 3.2%
 
1990 915 3.2%
 
1992 899 3.1%
 
1993 885 3.1%
 
1989 864 3.0%
 
1987 842 2.9%
 
1994 812 2.8%
 
1988 806 2.8%
 
1986 771 2.7%
 
1985 751 2.6%
 
Other values (77) 20115 70.4%
 

Minimum 5 values

Value Count Frequency (%)  
0 10 0.0%
 
1910 1 0.0%
 
1911 4 0.0%
 
1917 1 0.0%
 
1918 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1996 635 2.2%
 
1997 629 2.2%
 
1998 553 1.9%
 
1999 309 1.1%
 
2000 2 0.0%
 

cust_status_ind
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.8326
1
23794
0
 
4784
Value Count Frequency (%)  
1 23794 83.3%
 
0 4784 16.7%
 

data_overage_amt
Numeric

Distinct count 801
Unique (%) 2.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.93202
Minimum 0
Maximum 375
Zeros (%) 78.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 5
Maximum 375
Range 375
Interquartile range 0

Descriptive statistics

Standard deviation 4.7323
Coef of variation 5.0775
Kurtosis 1770
Mean 0.93202
MAD 1.5013
Skewness 29.96
Sum 26635
Variance 22.395
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 22352 78.2%
 
0.62 155 0.5%
 
0.6 152 0.5%
 
0.75 150 0.5%
 
0.65 143 0.5%
 
0.71 126 0.4%
 
0.58 122 0.4%
 
0.68 119 0.4%
 
0.56 119 0.4%
 
0.52 115 0.4%
 
Other values (791) 5025 17.6%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 22352 78.2%
 
0.01 16 0.1%
 
0.02 8 0.0%
 
0.03 5 0.0%
 
0.04 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
118.33 1 0.0%
 
131.0 1 0.0%
 
182.14 1 0.0%
 
240.0 1 0.0%
 
375.0 1 0.0%
 

data_ovrg_Yes_cnt
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.23564
Minimum 0
Maximum 5
Zeros (%) 78.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 5
Range 5
Interquartile range 0

Descriptive statistics

Standard deviation 0.46516
Coef of variation 1.974
Kurtosis 3.7476
Mean 0.23564
MAD 0.36811
Skewness 1.8829
Sum 6734
Variance 0.21637
Memory size 223.3 KiB
Value Count Frequency (%)  
0 22322 78.1%
 
1 5808 20.3%
 
2 425 1.5%
 
3 19 0.1%
 
5 3 0.0%
 
4 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 22322 78.1%
 
1 5808 20.3%
 
2 425 1.5%
 
3 19 0.1%
 
4 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 5808 20.3%
 
2 425 1.5%
 
3 19 0.1%
 
4 1 0.0%
 
5 3 0.0%
 

eqp_rev_acrtn
Numeric

Distinct count 2172
Unique (%) 7.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -1.2218
Minimum -1234.5
Maximum 1196
Zeros (%) 23.3%

Quantile statistics

Minimum -1234.5
5-th percentile -6.09
Q1 -1.16
Median -0.01
Q3 0
95-th percentile 1.54
Maximum 1196
Range 2430.5
Interquartile range 1.16

Descriptive statistics

Standard deviation 12.845
Coef of variation -10.513
Kurtosis 5776.2
Mean -1.2218
MAD 2.3067
Skewness -7.0034
Sum -34917
Variance 165
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 6647 23.3%
 
-0.01 1828 6.4%
 
-0.02 181 0.6%
 
-0.63 167 0.6%
 
-1.26 166 0.6%
 
-1.09 137 0.5%
 
-0.9 136 0.5%
 
-1.3 135 0.5%
 
-1.04 131 0.5%
 
-1.35 127 0.4%
 
Other values (2162) 18923 66.2%
 

Minimum 5 values

Value Count Frequency (%)  
-1234.53 1 0.0%
 
-564.31 1 0.0%
 
-265.5 1 0.0%
 
-212.67 1 0.0%
 
-190.54 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
75.55 1 0.0%
 
165.49 1 0.0%
 
207.7 1 0.0%
 
270.5 1 0.0%
 
1196.0 1 0.0%
 

frequency
Numeric

Distinct count 33
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 23.322
Minimum 2
Maximum 51
Zeros (%) 0.0%

Quantile statistics

Minimum 2
5-th percentile 13
Q1 21
Median 24
Q3 27
95-th percentile 30
Maximum 51
Range 49
Interquartile range 6

Descriptive statistics

Standard deviation 5.0344
Coef of variation 0.21586
Kurtosis 1.2174
Mean 23.322
MAD 3.8635
Skewness -0.91017
Sum 666506
Variance 25.345
Memory size 223.3 KiB
Value Count Frequency (%)  
21 2906 10.2%
 
20 2697 9.4%
 
22 2417 8.5%
 
23 2281 8.0%
 
24 2251 7.9%
 
25 2131 7.5%
 
26 1937 6.8%
 
27 1905 6.7%
 
28 1779 6.2%
 
29 1759 6.2%
 
Other values (23) 6515 22.8%
 

Minimum 5 values

Value Count Frequency (%)  
2 6 0.0%
 
3 12 0.0%
 
4 30 0.1%
 
5 43 0.2%
 
6 49 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
30 1526 5.3%
 
31 1220 4.3%
 
32 1 0.0%
 
34 1 0.0%
 
51 1 0.0%
 

gl_late_fees_amt
Numeric

Distinct count 1783
Unique (%) 6.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 23.276
Minimum -25
Maximum 158.29
Zeros (%) 34.6%

Quantile statistics

Minimum -25
5-th percentile 0
Q1 0
Median 5
Q3 39.758
95-th percentile 95
Maximum 158.29
Range 183.29
Interquartile range 39.758

Descriptive statistics

Standard deviation 31.337
Coef of variation 1.3463
Kurtosis 1.1951
Mean 23.276
MAD 25.086
Skewness 1.4538
Sum 665190
Variance 982.03
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 9879 34.6%
 
5.0 4571 16.0%
 
10.0 2045 7.2%
 
15.0 1287 4.5%
 
20.0 908 3.2%
 
25.0 733 2.6%
 
30.0 638 2.2%
 
35.0 557 1.9%
 
45.0 552 1.9%
 
40.0 534 1.9%
 
Other values (1773) 6874 24.1%
 

Minimum 5 values

Value Count Frequency (%)  
-25.0 1 0.0%
 
-20.0 1 0.0%
 
-10.0 1 0.0%
 
-5.95 1 0.0%
 
-5.0 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
147.92 1 0.0%
 
149.07 1 0.0%
 
150.0 3 0.0%
 
155.0 1 0.0%
 
158.29 1 0.0%
 

gl_nonrecur_rev_amt
Numeric

Distinct count 8748
Unique (%) 30.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 79.67
Minimum -955.59
Maximum 6893.2
Zeros (%) 15.4%

Quantile statistics

Minimum -955.59
5-th percentile 0
Q1 5
Median 30.705
Q3 105
95-th percentile 328.33
Maximum 6893.2
Range 7848.8
Interquartile range 100

Descriptive statistics

Standard deviation 147.75
Coef of variation 1.8545
Kurtosis 193.53
Mean 79.67
MAD 87.675
Skewness 7.2609
Sum 2276800
Variance 21831
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 4400 15.4%
 
5.0 1819 6.4%
 
15.0 878 3.1%
 
10.0 734 2.6%
 
30.0 511 1.8%
 
20.0 480 1.7%
 
25.0 377 1.3%
 
45.0 356 1.2%
 
35.0 317 1.1%
 
1.99 314 1.1%
 
Other values (8738) 18392 64.4%
 

Minimum 5 values

Value Count Frequency (%)  
-955.59 1 0.0%
 
-849.0 1 0.0%
 
-787.86 1 0.0%
 
-755.79 1 0.0%
 
-715.0 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2652.27 1 0.0%
 
2748.31 1 0.0%
 
2827.88 1 0.0%
 
2850.75 1 0.0%
 
6893.17 1 0.0%
 

handset_paychnl_cnt
Numeric

Distinct count 12
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.73515
Minimum 0
Maximum 16
Zeros (%) 51.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 16
Range 16
Interquartile range 1

Descriptive statistics

Standard deviation 0.97608
Coef of variation 1.3277
Kurtosis 7.5324
Mean 0.73515
MAD 0.75732
Skewness 1.964
Sum 21009
Variance 0.95274
Memory size 223.3 KiB
Value Count Frequency (%)  
0 14720 51.5%
 
1 8961 31.4%
 
2 3491 12.2%
 
3 884 3.1%
 
4 322 1.1%
 
5 129 0.5%
 
6 42 0.1%
 
7 17 0.1%
 
8 7 0.0%
 
9 3 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14720 51.5%
 
1 8961 31.4%
 
2 3491 12.2%
 
3 884 3.1%
 
4 322 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 17 0.1%
 
8 7 0.0%
 
9 3 0.0%
 
11 1 0.0%
 
16 1 0.0%
 

index
Numeric

Distinct count 28578
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 382960
Minimum 8
Maximum 758185
Zeros (%) 0.0%

Quantile statistics

Minimum 8
5-th percentile 40090
Q1 194810
Median 385500
Q3 572820
95-th percentile 721180
Maximum 758185
Range 758177
Interquartile range 378010

Descriptive statistics

Standard deviation 218450
Coef of variation 0.57044
Kurtosis -1.1973
Mean 382960
MAD 189070
Skewness -0.026886
Sum 10944093801
Variance 47722000000
Memory size 223.3 KiB
Value Count Frequency (%)  
1 198655 695.1%
 
1 376477 1317.4%
 
1 683655 2392.2%
 
1 1674 5.9%
 
1 16013 56.0%
 
1 723817 2532.8%
 
1 732815 2564.3%
 
1 253585 887.3%
 
1 116370 407.2%
 
1 118419 414.4%
 
Other values (28568) 10940872321 38284247.7%
 

Minimum 5 values

Value Count Frequency (%)  
1 198655 695.1%
 
1 376477 1317.4%
 
1 683655 2392.2%
 
1 1674 5.9%
 
1 16013 56.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 541629 1895.3%
 
1 154558 540.8%
 
1 676799 2368.3%
 
1 244674 856.2%
 
1 137219 480.2%
 

monetary
Numeric

Distinct count 26738
Unique (%) 93.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2824.5
Minimum 1706.5
Maximum 3914.8
Zeros (%) 0.0%

Quantile statistics

Minimum 1706.5
5-th percentile 1872.4
Q1 2336.6
Median 2824
Q3 3315
95-th percentile 3773.9
Maximum 3914.8
Range 2208.3
Interquartile range 978.37

Descriptive statistics

Standard deviation 594
Coef of variation 0.2103
Kurtosis -1.0741
Mean 2824.5
MAD 507.21
Skewness -0.010456
Sum 80719000
Variance 352840
Memory size 223.3 KiB
Value Count Frequency (%)  
1970.38 4 0.0%
 
2888.51 3 0.0%
 
2423.8 3 0.0%
 
3865.15 3 0.0%
 
2237.13 3 0.0%
 
2902.81 3 0.0%
 
3478.4 3 0.0%
 
3074.56 3 0.0%
 
2388.41 3 0.0%
 
2607.95 3 0.0%
 
Other values (26728) 28547 99.9%
 

Minimum 5 values

Value Count Frequency (%)  
1706.48 1 0.0%
 
1709.46 1 0.0%
 
1710.22 1 0.0%
 
1710.79 1 0.0%
 
1711.13 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
3913.82 1 0.0%
 
3914.37 1 0.0%
 
3914.58 1 0.0%
 
3914.62 1 0.0%
 
3914.77 1 0.0%
 

pplan_access_acrtn
Numeric

Distinct count 766
Unique (%) 2.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3.3915
Minimum -3.6
Maximum 259.29
Zeros (%) 37.4%

Quantile statistics

Minimum -3.6
5-th percentile 0
Q1 0
Median 1.61
Q3 2.95
95-th percentile 10.77
Maximum 259.29
Range 262.89
Interquartile range 2.95

Descriptive statistics

Standard deviation 9.8671
Coef of variation 2.9093
Kurtosis 154.71
Mean 3.3915
MAD 3.625
Skewness 10.382
Sum 96923
Variance 97.359
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 10678 37.4%
 
2.5 715 2.5%
 
2.41 660 2.3%
 
2.6 645 2.3%
 
2.32 600 2.1%
 
2.24 553 1.9%
 
2.71 478 1.7%
 
2.17 434 1.5%
 
1.67 428 1.5%
 
1.38 336 1.2%
 
Other values (756) 13051 45.7%
 

Minimum 5 values

Value Count Frequency (%)  
-3.6 1 0.0%
 
-2.33 1 0.0%
 
-2.0 1 0.0%
 
-1.9 1 0.0%
 
-1.67 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
204.29 1 0.0%
 
210.0 1 0.0%
 
220.0 4 0.0%
 
225.0 1 0.0%
 
259.29 1 0.0%
 

recency
Numeric

Distinct count 30
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 102.23
Minimum 51
Maximum 931
Zeros (%) 0.0%

Quantile statistics

Minimum 51
5-th percentile 51
Q1 51
Median 51
Q3 51
95-th percentile 416
Maximum 931
Range 880
Interquartile range 0

Descriptive statistics

Standard deviation 128.6
Coef of variation 1.2579
Kurtosis 7.1069
Mean 102.23
MAD 83.728
Skewness 2.7416
Sum 2921609
Variance 16537
Memory size 223.3 KiB
Value Count Frequency (%)  
51 23197 81.2%
 
232 404 1.4%
 
81 374 1.3%
 
263 362 1.3%
 
201 352 1.2%
 
142 327 1.1%
 
293 320 1.1%
 
354 319 1.1%
 
173 308 1.1%
 
385 307 1.1%
 
Other values (20) 2308 8.1%
 

Minimum 5 values

Value Count Frequency (%)  
51 23197 81.2%
 
81 374 1.3%
 
112 301 1.1%
 
142 327 1.1%
 
173 308 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
811 3 0.0%
 
842 3 0.0%
 
872 4 0.0%
 
903 4 0.0%
 
931 2 0.0%
 

recurring_paychnl_cnt
Numeric

Distinct count 11
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.0412
Minimum 0
Maximum 10
Zeros (%) 32.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 2
95-th percentile 3
Maximum 10
Range 10
Interquartile range 2

Descriptive statistics

Standard deviation 0.99076
Coef of variation 0.95154
Kurtosis 3.5906
Mean 1.0412
MAD 0.70038
Skewness 1.3234
Sum 29756
Variance 0.98161
Memory size 223.3 KiB
Value Count Frequency (%)  
1 12038 42.1%
 
0 9135 32.0%
 
2 5431 19.0%
 
3 1347 4.7%
 
4 429 1.5%
 
5 130 0.5%
 
6 44 0.2%
 
7 15 0.1%
 
9 4 0.0%
 
8 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 9135 32.0%
 
1 12038 42.1%
 
2 5431 19.0%
 
3 1347 4.7%
 
4 429 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
6 44 0.2%
 
7 15 0.1%
 
8 3 0.0%
 
9 4 0.0%
 
10 2 0.0%
 

tot_revenue_acrtn
Numeric

Distinct count 4078
Unique (%) 14.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -5.5583
Minimum -1342
Maximum 1190.2
Zeros (%) 0.2%

Quantile statistics

Minimum -1342
5-th percentile -24.283
Q1 -5.61
Median -2.02
Q3 -0.47
95-th percentile 1.93
Maximum 1190.2
Range 2532.2
Interquartile range 5.14

Descriptive statistics

Standard deviation 20.964
Coef of variation -3.7716
Kurtosis 1396
Mean -5.5583
MAD 7.0157
Skewness 0.43079
Sum -158850
Variance 439.48
Memory size 223.3 KiB
Value Count Frequency (%)  
0.01 199 0.7%
 
-1.57 106 0.4%
 
-1.45 97 0.3%
 
-1.38 90 0.3%
 
0.02 81 0.3%
 
-1.65 76 0.3%
 
-1.2 74 0.3%
 
-1.32 73 0.3%
 
-1.5 71 0.2%
 
-1.21 70 0.2%
 
Other values (4068) 27641 96.7%
 

Minimum 5 values

Value Count Frequency (%)  
-1341.96 1 0.0%
 
-779.22 1 0.0%
 
-343.53 1 0.0%
 
-341.12 1 0.0%
 
-334.27 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
273.61 1 0.0%
 
435.0 1 0.0%
 
813.66 1 0.0%
 
1094.1 1 0.0%
 
1190.2 1 0.0%
 

usage_rev_acrtn
Numeric

Distinct count 505
Unique (%) 1.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.13779
Minimum -93.29
Maximum 1008.6
Zeros (%) 91.3%

Quantile statistics

Minimum -93.29
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.22
Maximum 1008.6
Range 1101.9
Interquartile range 0

Descriptive statistics

Standard deviation 7.8918
Coef of variation 57.274
Kurtosis 12386
Mean 0.13779
MAD 0.31338
Skewness 106.45
Sum 3937.8
Variance 62.28
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 26088 91.3%
 
0.09 95 0.3%
 
0.07 89 0.3%
 
0.08 79 0.3%
 
0.5 51 0.2%
 
0.1 51 0.2%
 
0.01 46 0.2%
 
0.12 42 0.1%
 
0.48 40 0.1%
 
0.75 38 0.1%
 
Other values (495) 1959 6.9%
 

Minimum 5 values

Value Count Frequency (%)  
-93.29 1 0.0%
 
-52.12 1 0.0%
 
-35.85 1 0.0%
 
-26.98 1 0.0%
 
-21.75 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
190.9 1 0.0%
 
227.56 1 0.0%
 
245.0 1 0.0%
 
758.62 1 0.0%
 
1008.59 1 0.0%
 

voice_overage_amt
Numeric

Distinct count 602
Unique (%) 2.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.32765
Minimum 0
Maximum 772.2
Zeros (%) 91.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.33
Maximum 772.2
Range 772.2
Interquartile range 0

Descriptive statistics

Standard deviation 8.5909
Coef of variation 26.22
Kurtosis 4943.4
Mean 0.32765
MAD 0.61301
Skewness 66.2
Sum 9363.5
Variance 73.803
Memory size 223.3 KiB
Value Count Frequency (%)  
0.0 26208 91.7%
 
0.07 97 0.3%
 
0.09 85 0.3%
 
0.06 66 0.2%
 
0.08 54 0.2%
 
0.14 43 0.2%
 
0.12 40 0.1%
 
0.04 40 0.1%
 
0.19 36 0.1%
 
0.16 35 0.1%
 
Other values (592) 1874 6.6%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 26208 91.7%
 
0.01 2 0.0%
 
0.02 2 0.0%
 
0.03 18 0.1%
 
0.04 40 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
343.67 1 0.0%
 
552.4 1 0.0%
 
560.72 1 0.0%
 
664.98 1 0.0%
 
772.2 1 0.0%
 

Correlations

Sample

cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
56841 1 51 27 2164.37 1 0 1 1 1952 0 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 80.16 5.0 6.99 1.30 -1.61 -0.29 0.00 0.00 0.00 0.00 0
237065 1 51 27 1828.20 2 0 1 2 1932 0 0 0 0 0 0 0 2 2 2 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 67.71 0.0 0.00 1.48 -3.27 0.00 -0.78 0.00 0.00 0.00 0
62699 1 51 24 2805.00 1 1 1 1 1991 0 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0 0 0 116.88 5.0 -46.28 1.67 -0.10 -1.04 2.00 0.00 1.25 0.00 0
486254 1 51 29 3088.69 1 0 1 1 1962 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 106.51 70.0 370.00 0.00 0.76 -1.21 -0.10 0.00 10.34 0.00 0
104869 1 51 20 3415.30 4 0 4 4 1983 0 0 4 4 0 0 1 1 4 0 0 0 0 4 2 0 0 0 4 0 3 4 0 0 0 0 0 170.76 0.0 62.40 14.00 -1.30 1.14 8.20 0.75 1.50 0.81 0

Cluster 0 - Silver Findings:

1. Recency - Low (50 - 750)
2. Frequency - Low to Medium High (2 to 28)
3. Monetary - Low to High 
4. VoiceTextData_pplangrp_cnt - 1 to 5
5. Customer usually take Loans
6. IphoneDev_cnt - 67.6%
7. Total Revenue Acrtn - Negative
8. Some customers have voice overage
9. Customers mostly from TX
10. ARPU Amount - 128.36 (Mean)
11. billed_acct_size - 1.6 (Mean)
12. Birth Year - 1975 (Mean)
13. % of AT&T port in - 20.1%
14. % of Sprint_port in - 11.3%
15. % of T mobile port in - 12.5%
16. % of Android Devices - 42.2%
17. CA users - 49.3%
18. TX users - 26%
19. CommunicStore_chnl_cnt - 45.7%
20. Edge_contract_cnt - 85.8%
21. NewVZPlan_plan_cnt - 54.6%
22. SingleLine_plan_cnt - 2.4%
23. data_ovrg_Yes_cnt - 22%
24. tot_revenue_acrtn - -5.3379 (Mean)

Cluster 0:

In [42]:
df[df['Customer_Cluster']== 0].head(10)
Out[42]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
56841 1 51 27 2164.37 1 0 1 1 1952 0 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 80.16 5.0 6.99 1.30 -1.61 -0.29 0.00 0.00 0.00 0.00 0
237065 1 51 27 1828.20 2 0 1 2 1932 0 0 0 0 0 0 0 2 2 2 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 67.71 0.0 0.00 1.48 -3.27 0.00 -0.78 0.00 0.00 0.00 0
62699 1 51 24 2805.00 1 1 1 1 1991 0 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0 0 0 116.88 5.0 -46.28 1.67 -0.10 -1.04 2.00 0.00 1.25 0.00 0
486254 1 51 29 3088.69 1 0 1 1 1962 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 106.51 70.0 370.00 0.00 0.76 -1.21 -0.10 0.00 10.34 0.00 0
104869 1 51 20 3415.30 4 0 4 4 1983 0 0 4 4 0 0 1 1 4 0 0 0 0 4 2 0 0 0 4 0 3 4 0 0 0 0 0 170.76 0.0 62.40 14.00 -1.30 1.14 8.20 0.75 1.50 0.81 0
586227 0 201 21 1861.50 3 0 3 0 1973 0 0 0 0 3 3 0 0 0 2 0 0 0 0 1 0 0 0 3 0 1 2 0 0 3 1 0 88.64 15.0 15.00 5.00 -9.95 -1.57 0.00 0.00 0.00 0.00 0
43505 0 477 13 2863.65 0 2 2 4 1976 0 0 0 2 0 0 0 2 0 2 0 0 0 0 0 0 2 0 2 2 0 2 0 2 2 0 0 220.28 0.0 0.00 16.92 -5.58 0.00 -0.02 0.00 0.00 0.00 0
427692 1 51 28 2792.51 0 1 1 1 1987 0 0 1 1 0 0 0 1 1 0 0 0 0 1 0 0 1 0 1 1 0 1 0 0 0 0 0 99.73 0.0 -248.63 2.32 1.20 0.00 0.83 0.00 0.00 0.00 0
561895 1 51 22 2789.65 0 0 1 1 1973 1 0 1 1 0 0 0 1 1 1 0 0 1 0 0 0 1 0 1 0 0 1 1 0 0 0 0 126.80 20.0 64.75 0.00 2.04 0.00 2.75 0.00 0.00 2.03 0
526904 1 51 22 3572.31 0 0 2 2 1981 2 1 2 2 2 0 0 1 0 0 0 0 0 2 0 0 0 0 2 0 1 2 0 0 2 0 0 162.38 0.0 1.99 0.00 -6.95 0.00 0.85 0.00 0.00 0.00 0

Cluster 0 contains customers with medium low monetary values and are in the process of churning. We can name this cluster as "Silver"

Cluster 1:

In [43]:
profile2 = pandas_profiling.ProfileReport(df[df['Customer_Cluster']== 1])
profile2
Out[43]:

Overview

Dataset info

Number of variables 48
Number of observations 9187
Total Missing (%) 0.0%
Total size in memory 3.3 MiB
Average record size in memory 380.0 B

Variables types

Numeric 44
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 3
Unsupported 0

Warnings

Variables

ATT_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.83259
Minimum 0
Maximum 9
Zeros (%) 66.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 9
Range 9
Interquartile range 1

Descriptive statistics

Standard deviation 1.3827
Coef of variation 1.6607
Kurtosis 1.9383
Mean 0.83259
MAD 1.1009
Skewness 1.639
Sum 7649
Variance 1.9119
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6074 66.1%
 
2 898 9.8%
 
1 870 9.5%
 
3 664 7.2%
 
4 485 5.3%
 
5 145 1.6%
 
6 36 0.4%
 
7 11 0.1%
 
8 3 0.0%
 
9 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6074 66.1%
 
1 870 9.5%
 
2 898 9.8%
 
3 664 7.2%
 
4 485 5.3%
 

Maximum 5 values

Value Count Frequency (%)  
5 145 1.6%
 
6 36 0.4%
 
7 11 0.1%
 
8 3 0.0%
 
9 1 0.0%
 

AndroidDev_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.2865
Minimum 0
Maximum 11
Zeros (%) 41.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 2
95-th percentile 4
Maximum 11
Range 11
Interquartile range 2

Descriptive statistics

Standard deviation 1.4425
Coef of variation 1.1213
Kurtosis 1.0588
Mean 1.2865
MAD 1.1912
Skewness 1.1159
Sum 11819
Variance 2.0808
Memory size 71.9 KiB
Value Count Frequency (%)  
0 3838 41.8%
 
1 1865 20.3%
 
2 1698 18.5%
 
3 989 10.8%
 
4 522 5.7%
 
5 185 2.0%
 
6 67 0.7%
 
7 14 0.2%
 
8 6 0.1%
 
11 1 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 3838 41.8%
 
1 1865 20.3%
 
2 1698 18.5%
 
3 989 10.8%
 
4 522 5.7%
 

Maximum 5 values

Value Count Frequency (%)  
7 14 0.2%
 
8 6 0.1%
 
9 1 0.0%
 
10 1 0.0%
 
11 1 0.0%
 

BasicDev_cnt
Numeric

Distinct count 6
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.063133
Minimum 0
Maximum 5
Zeros (%) 94.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 5
Range 5
Interquartile range 0

Descriptive statistics

Standard deviation 0.29165
Coef of variation 4.6197
Kurtosis 43.247
Mean 0.063133
MAD 0.11963
Skewness 5.757
Sum 580
Variance 0.085062
Memory size 71.9 KiB
Value Count Frequency (%)  
0 8704 94.7%
 
1 403 4.4%
 
2 67 0.7%
 
3 10 0.1%
 
4 2 0.0%
 
5 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8704 94.7%
 
1 403 4.4%
 
2 67 0.7%
 
3 10 0.1%
 
4 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 403 4.4%
 
2 67 0.7%
 
3 10 0.1%
 
4 2 0.0%
 
5 1 0.0%
 

BeyondUnlmtd_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.75052
Minimum 0
Maximum 11
Zeros (%) 82.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 5
Maximum 11
Range 11
Interquartile range 0

Descriptive statistics

Standard deviation 1.7499
Coef of variation 2.3316
Kurtosis 4.3975
Mean 0.75052
MAD 1.2362
Skewness 2.2849
Sum 6895
Variance 3.0623
Memory size 71.9 KiB
Value Count Frequency (%)  
0 7566 82.4%
 
4 449 4.9%
 
5 349 3.8%
 
3 330 3.6%
 
6 172 1.9%
 
2 172 1.9%
 
7 80 0.9%
 
1 23 0.3%
 
8 21 0.2%
 
9 16 0.2%
 
Other values (2) 9 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 7566 82.4%
 
1 23 0.3%
 
2 172 1.9%
 
3 330 3.6%
 
4 449 4.9%
 

Maximum 5 values

Value Count Frequency (%)  
7 80 0.9%
 
8 21 0.2%
 
9 16 0.2%
 
10 6 0.1%
 
11 3 0.0%
 

CA_state_cd
Numeric

Distinct count 15
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.1836
Minimum 0
Maximum 14
Zeros (%) 52.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 4
95-th percentile 7
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.5767
Coef of variation 1.18
Kurtosis -0.42308
Mean 2.1836
MAD 2.3215
Skewness 0.7601
Sum 20061
Variance 6.6391
Memory size 71.9 KiB
Value Count Frequency (%)  
0 4856 52.9%
 
4 1220 13.3%
 
5 1001 10.9%
 
3 744 8.1%
 
6 554 6.0%
 
2 294 3.2%
 
7 283 3.1%
 
8 127 1.4%
 
9 41 0.4%
 
10 36 0.4%
 
Other values (5) 31 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
0 4856 52.9%
 
1 5 0.1%
 
2 294 3.2%
 
3 744 8.1%
 
4 1220 13.3%
 

Maximum 5 values

Value Count Frequency (%)  
10 36 0.4%
 
11 18 0.2%
 
12 4 0.0%
 
13 1 0.0%
 
14 3 0.0%
 

CommunicStore_chnl_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.2117
Minimum 0
Maximum 14
Zeros (%) 34.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 2
Q3 4
95-th percentile 6
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.126
Coef of variation 0.96124
Kurtosis -0.2623
Mean 2.2117
MAD 1.8264
Skewness 0.6411
Sum 20319
Variance 4.5198
Memory size 71.9 KiB
Value Count Frequency (%)  
0 3201 34.8%
 
4 1281 13.9%
 
3 1273 13.9%
 
2 1093 11.9%
 
1 890 9.7%
 
5 819 8.9%
 
6 368 4.0%
 
7 166 1.8%
 
8 63 0.7%
 
10 13 0.1%
 
Other values (3) 20 0.2%
 

Minimum 5 values

Value Count Frequency (%)  
0 3201 34.8%
 
1 890 9.7%
 
2 1093 11.9%
 
3 1273 13.9%
 
4 1281 13.9%
 

Maximum 5 values

Value Count Frequency (%)  
8 63 0.7%
 
9 11 0.1%
 
10 13 0.1%
 
11 8 0.1%
 
14 1 0.0%
 

Customer_Cluster
Constant

This variable is constant and should be ignored for analysis

Constant value 1

DataOnly_Revtype_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.70905
Minimum 0
Maximum 14
Zeros (%) 52.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 14
Range 14
Interquartile range 1

Descriptive statistics

Standard deviation 0.9594
Coef of variation 1.3531
Kurtosis 9.6919
Mean 0.70905
MAD 0.75018
Skewness 2.0909
Sum 6514
Variance 0.92045
Memory size 71.9 KiB
Value Count Frequency (%)  
0 4860 52.9%
 
1 2805 30.5%
 
2 1085 11.8%
 
3 298 3.2%
 
4 93 1.0%
 
5 29 0.3%
 
6 7 0.1%
 
7 4 0.0%
 
9 2 0.0%
 
8 2 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 4860 52.9%
 
1 2805 30.5%
 
2 1085 11.8%
 
3 298 3.2%
 
4 93 1.0%
 

Maximum 5 values

Value Count Frequency (%)  
7 4 0.0%
 
8 2 0.0%
 
9 2 0.0%
 
10 1 0.0%
 
14 1 0.0%
 

Edge_contract_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3.3788
Minimum 0
Maximum 13
Zeros (%) 0.9%

Quantile statistics

Minimum 0
5-th percentile 2
Q1 2
Median 3
Q3 4
95-th percentile 6
Maximum 13
Range 13
Interquartile range 2

Descriptive statistics

Standard deviation 1.2684
Coef of variation 0.37539
Kurtosis 1.4331
Mean 3.3788
MAD 1.0137
Skewness 0.58013
Sum 31041
Variance 1.6087
Memory size 71.9 KiB
Value Count Frequency (%)  
3 2838 30.9%
 
4 2536 27.6%
 
2 2035 22.2%
 
5 1025 11.2%
 
6 325 3.5%
 
1 211 2.3%
 
7 85 0.9%
 
0 81 0.9%
 
8 35 0.4%
 
9 11 0.1%
 
Other values (2) 5 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 81 0.9%
 
1 211 2.3%
 
2 2035 22.2%
 
3 2838 30.9%
 
4 2536 27.6%
 

Maximum 5 values

Value Count Frequency (%)  
7 85 0.9%
 
8 35 0.4%
 
9 11 0.1%
 
10 4 0.0%
 
13 1 0.0%
 

FamTeens_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.5643
Minimum 0
Maximum 14
Zeros (%) 62.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 3
95-th percentile 6
Maximum 14
Range 14
Interquartile range 3

Descriptive statistics

Standard deviation 2.2322
Coef of variation 1.427
Kurtosis 0.16323
Mean 1.5643
MAD 1.962
Skewness 1.1129
Sum 14371
Variance 4.9827
Memory size 71.9 KiB
Value Count Frequency (%)  
0 5743 62.5%
 
4 1022 11.1%
 
3 716 7.8%
 
5 685 7.5%
 
2 395 4.3%
 
6 323 3.5%
 
7 153 1.7%
 
8 57 0.6%
 
1 51 0.6%
 
9 24 0.3%
 
Other values (4) 18 0.2%
 

Minimum 5 values

Value Count Frequency (%)  
0 5743 62.5%
 
1 51 0.6%
 
2 395 4.3%
 
3 716 7.8%
 
4 1022 11.1%
 

Maximum 5 values

Value Count Frequency (%)  
9 24 0.3%
 
10 14 0.2%
 
11 2 0.0%
 
12 1 0.0%
 
14 1 0.0%
 

GoUnlimit_plan_cnt
Numeric

Distinct count 15
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.4868
Minimum 0
Maximum 16
Zeros (%) 65.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 3
95-th percentile 6
Maximum 16
Range 16
Interquartile range 3

Descriptive statistics

Standard deviation 2.2902
Coef of variation 1.5404
Kurtosis 0.66921
Mean 1.4868
MAD 1.9677
Skewness 1.2882
Sum 13659
Variance 5.2448
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6038 65.7%
 
4 921 10.0%
 
5 643 7.0%
 
3 497 5.4%
 
6 368 4.0%
 
2 290 3.2%
 
7 172 1.9%
 
1 126 1.4%
 
8 82 0.9%
 
9 27 0.3%
 
Other values (5) 23 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
0 6038 65.7%
 
1 126 1.4%
 
2 290 3.2%
 
3 497 5.4%
 
4 921 10.0%
 

Maximum 5 values

Value Count Frequency (%)  
10 12 0.1%
 
11 5 0.1%
 
12 4 0.0%
 
13 1 0.0%
 
16 1 0.0%
 

Hispanic_ethnicity_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.6923
Minimum 0
Maximum 14
Zeros (%) 62.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 4
95-th percentile 6
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.3817
Coef of variation 1.4074
Kurtosis 0.10114
Mean 1.6923
MAD 2.1068
Skewness 1.0787
Sum 15547
Variance 5.6727
Memory size 71.9 KiB
Value Count Frequency (%)  
0 5713 62.2%
 
4 1022 11.1%
 
5 778 8.5%
 
3 648 7.1%
 
6 399 4.3%
 
2 269 2.9%
 
7 191 2.1%
 
8 91 1.0%
 
9 28 0.3%
 
10 18 0.2%
 
Other values (4) 30 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
0 5713 62.2%
 
1 14 0.2%
 
2 269 2.9%
 
3 648 7.1%
 
4 1022 11.1%
 

Maximum 5 values

Value Count Frequency (%)  
9 28 0.3%
 
10 18 0.2%
 
11 12 0.1%
 
12 3 0.0%
 
14 1 0.0%
 

IphoneDev_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.5138
Minimum 0
Maximum 14
Zeros (%) 12.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 2
Q3 4
95-th percentile 5
Maximum 14
Range 14
Interquartile range 3

Descriptive statistics

Standard deviation 1.6074
Coef of variation 0.63945
Kurtosis 0.80458
Mean 2.5138
MAD 1.2962
Skewness 0.50428
Sum 23094
Variance 2.5838
Memory size 71.9 KiB
Value Count Frequency (%)  
2 2387 26.0%
 
3 2021 22.0%
 
4 1531 16.7%
 
1 1245 13.6%
 
0 1131 12.3%
 
5 551 6.0%
 
6 197 2.1%
 
7 73 0.8%
 
8 32 0.3%
 
9 13 0.1%
 
Other values (3) 6 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 1131 12.3%
 
1 1245 13.6%
 
2 2387 26.0%
 
3 2021 22.0%
 
4 1531 16.7%
 

Maximum 5 values

Value Count Frequency (%)  
8 32 0.3%
 
9 13 0.1%
 
10 4 0.0%
 
13 1 0.0%
 
14 1 0.0%
 

LocalRetailer_chnl_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.82682
Minimum 0
Maximum 11
Zeros (%) 71.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 1.5868
Coef of variation 1.9191
Kurtosis 4.0787
Mean 0.82682
MAD 1.1795
Skewness 2.0839
Sum 7596
Variance 2.5179
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6553 71.3%
 
1 705 7.7%
 
2 544 5.9%
 
3 484 5.3%
 
4 457 5.0%
 
5 264 2.9%
 
6 110 1.2%
 
7 41 0.4%
 
8 15 0.2%
 
9 8 0.1%
 
Other values (2) 6 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 6553 71.3%
 
1 705 7.7%
 
2 544 5.9%
 
3 484 5.3%
 
4 457 5.0%
 

Maximum 5 values

Value Count Frequency (%)  
7 41 0.4%
 
8 15 0.2%
 
9 8 0.1%
 
10 2 0.0%
 
11 4 0.0%
 

MTM_contract_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.6348
Minimum 0
Maximum 13
Zeros (%) 4.4%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 2
Median 2
Q3 4
95-th percentile 5
Maximum 13
Range 13
Interquartile range 2

Descriptive statistics

Standard deviation 1.459
Coef of variation 0.55374
Kurtosis 1.9372
Mean 2.6348
MAD 1.1641
Skewness 0.82121
Sum 24206
Variance 2.1287
Memory size 71.9 KiB
Value Count Frequency (%)  
2 2848 31.0%
 
3 1998 21.7%
 
4 1632 17.8%
 
1 1514 16.5%
 
5 502 5.5%
 
0 404 4.4%
 
6 170 1.9%
 
7 57 0.6%
 
8 35 0.4%
 
9 13 0.1%
 
Other values (4) 14 0.2%
 

Minimum 5 values

Value Count Frequency (%)  
0 404 4.4%
 
1 1514 16.5%
 
2 2848 31.0%
 
3 1998 21.7%
 
4 1632 17.8%
 

Maximum 5 values

Value Count Frequency (%)  
9 13 0.1%
 
10 9 0.1%
 
11 3 0.0%
 
12 1 0.0%
 
13 1 0.0%
 

MatureAdults_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.58136
Minimum 0
Maximum 13
Zeros (%) 86.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 5
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 1.5836
Coef of variation 2.724
Kurtosis 7.8195
Mean 0.58136
MAD 1.0059
Skewness 2.8208
Sum 5341
Variance 2.5079
Memory size 71.9 KiB
Value Count Frequency (%)  
0 7948 86.5%
 
4 376 4.1%
 
3 278 3.0%
 
5 239 2.6%
 
6 141 1.5%
 
2 94 1.0%
 
7 54 0.6%
 
8 21 0.2%
 
1 15 0.2%
 
9 9 0.1%
 
Other values (4) 12 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 7948 86.5%
 
1 15 0.2%
 
2 94 1.0%
 
3 278 3.0%
 
4 376 4.1%
 

Maximum 5 values

Value Count Frequency (%)  
9 9 0.1%
 
10 5 0.1%
 
11 3 0.0%
 
12 3 0.0%
 
13 1 0.0%
 

NewVZPlan_plan_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.70698
Minimum 0
Maximum 13
Zeros (%) 76.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 4
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 1.489
Coef of variation 2.1062
Kurtosis 5.5447
Mean 0.70698
MAD 1.0834
Skewness 2.3142
Sum 6495
Variance 2.2172
Memory size 71.9 KiB
Value Count Frequency (%)  
0 7039 76.6%
 
2 586 6.4%
 
3 494 5.4%
 
4 408 4.4%
 
1 336 3.7%
 
5 186 2.0%
 
6 76 0.8%
 
7 33 0.4%
 
8 16 0.2%
 
9 7 0.1%
 
Other values (3) 6 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 7039 76.6%
 
1 336 3.7%
 
2 586 6.4%
 
3 494 5.4%
 
4 408 4.4%
 

Maximum 5 values

Value Count Frequency (%)  
8 16 0.2%
 
9 7 0.1%
 
10 3 0.0%
 
11 2 0.0%
 
13 1 0.0%
 

Purch_equip_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.98345
Minimum 0
Maximum 14
Zeros (%) 46.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 1
95-th percentile 4
Maximum 14
Range 14
Interquartile range 1

Descriptive statistics

Standard deviation 1.3211
Coef of variation 1.3433
Kurtosis 8.2227
Mean 0.98345
MAD 0.92254
Skewness 2.2373
Sum 9035
Variance 1.7452
Memory size 71.9 KiB
Value Count Frequency (%)  
0 4309 46.9%
 
1 2603 28.3%
 
2 1310 14.3%
 
3 504 5.5%
 
4 249 2.7%
 
5 101 1.1%
 
6 50 0.5%
 
7 29 0.3%
 
8 13 0.1%
 
9 9 0.1%
 
Other values (4) 10 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 4309 46.9%
 
1 2603 28.3%
 
2 1310 14.3%
 
3 504 5.5%
 
4 249 2.7%
 

Maximum 5 values

Value Count Frequency (%)  
9 9 0.1%
 
10 4 0.0%
 
11 3 0.0%
 
12 2 0.0%
 
14 1 0.0%
 

SingleLine_plan_cnt
Numeric

Distinct count 7
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.037227
Minimum 0
Maximum 13
Zeros (%) 97.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 0.26504
Coef of variation 7.1196
Kurtosis 672.31
Mean 0.037227
MAD 0.072265
Skewness 18.034
Sum 342
Variance 0.070245
Memory size 71.9 KiB
Value Count Frequency (%)  
0 8917 97.1%
 
1 225 2.4%
 
2 31 0.3%
 
3 11 0.1%
 
13 1 0.0%
 
5 1 0.0%
 
4 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8917 97.1%
 
1 225 2.4%
 
2 31 0.3%
 
3 11 0.1%
 
4 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2 31 0.3%
 
3 11 0.1%
 
4 1 0.0%
 
5 1 0.0%
 
13 1 0.0%
 

SngleCpleKids_cnt
Numeric

Distinct count 15
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.0078
Minimum 0
Maximum 14
Zeros (%) 53.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 4
95-th percentile 6
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.4079
Coef of variation 1.1992
Kurtosis -0.27574
Mean 2.0078
MAD 2.1461
Skewness 0.81718
Sum 18446
Variance 5.7979
Memory size 71.9 KiB
Value Count Frequency (%)  
0 4888 53.2%
 
4 1224 13.3%
 
3 918 10.0%
 
5 877 9.5%
 
6 437 4.8%
 
2 433 4.7%
 
7 218 2.4%
 
8 80 0.9%
 
1 40 0.4%
 
9 29 0.3%
 
Other values (5) 43 0.5%
 

Minimum 5 values

Value Count Frequency (%)  
0 4888 53.2%
 
1 40 0.4%
 
2 433 4.7%
 
3 918 10.0%
 
4 1224 13.3%
 

Maximum 5 values

Value Count Frequency (%)  
10 25 0.3%
 
11 13 0.1%
 
12 3 0.0%
 
13 1 0.0%
 
14 1 0.0%
 

Sprint_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.63492
Minimum 0
Maximum 10
Zeros (%) 74.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 4
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 1.2612
Coef of variation 1.9864
Kurtosis 3.4123
Mean 0.63492
MAD 0.94958
Skewness 2.0203
Sum 5833
Variance 1.5906
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6870 74.8%
 
2 713 7.8%
 
1 563 6.1%
 
3 500 5.4%
 
4 405 4.4%
 
5 103 1.1%
 
6 26 0.3%
 
7 5 0.1%
 
10 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6870 74.8%
 
1 563 6.1%
 
2 713 7.8%
 
3 500 5.4%
 
4 405 4.4%
 

Maximum 5 values

Value Count Frequency (%)  
5 103 1.1%
 
6 26 0.3%
 
7 5 0.1%
 
8 1 0.0%
 
10 1 0.0%
 

TX_state_cd
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.4972
Minimum 0
Maximum 14
Zeros (%) 65.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 3
95-th percentile 6
Maximum 14
Range 14
Interquartile range 3

Descriptive statistics

Standard deviation 2.2544
Coef of variation 1.5057
Kurtosis 0.41746
Mean 1.4972
MAD 1.9614
Skewness 1.2174
Sum 13755
Variance 5.0822
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6016 65.5%
 
4 925 10.1%
 
3 694 7.6%
 
5 663 7.2%
 
6 321 3.5%
 
2 282 3.1%
 
7 167 1.8%
 
8 63 0.7%
 
9 29 0.3%
 
10 18 0.2%
 
Other values (4) 9 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 6016 65.5%
 
1 5 0.1%
 
2 282 3.1%
 
3 694 7.6%
 
4 925 10.1%
 

Maximum 5 values

Value Count Frequency (%)  
9 29 0.3%
 
10 18 0.2%
 
11 1 0.0%
 
12 2 0.0%
 
14 1 0.0%
 

T_Mobile_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.53206
Minimum 0
Maximum 9
Zeros (%) 78.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 3
Maximum 9
Range 9
Interquartile range 0

Descriptive statistics

Standard deviation 1.1834
Coef of variation 2.2243
Kurtosis 5.8383
Mean 0.53206
MAD 0.83246
Skewness 2.4421
Sum 4888
Variance 1.4005
Memory size 71.9 KiB
Value Count Frequency (%)  
0 7187 78.2%
 
1 589 6.4%
 
2 576 6.3%
 
3 383 4.2%
 
4 321 3.5%
 
5 94 1.0%
 
6 24 0.3%
 
7 6 0.1%
 
8 5 0.1%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 7187 78.2%
 
1 589 6.4%
 
2 576 6.3%
 
3 383 4.2%
 
4 321 3.5%
 

Maximum 5 values

Value Count Frequency (%)  
5 94 1.0%
 
6 24 0.3%
 
7 6 0.1%
 
8 5 0.1%
 
9 2 0.0%
 

TwoYear_contract_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.47056
Minimum 0
Maximum 14
Zeros (%) 67.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 14
Range 14
Interquartile range 1

Descriptive statistics

Standard deviation 0.82543
Coef of variation 1.7542
Kurtosis 16.554
Mean 0.47056
MAD 0.63287
Skewness 2.7698
Sum 4323
Variance 0.68134
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6178 67.2%
 
1 2099 22.8%
 
2 639 7.0%
 
3 191 2.1%
 
4 54 0.6%
 
5 15 0.2%
 
6 6 0.1%
 
7 3 0.0%
 
14 1 0.0%
 
11 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6178 67.2%
 
1 2099 22.8%
 
2 639 7.0%
 
3 191 2.1%
 
4 54 0.6%
 

Maximum 5 values

Value Count Frequency (%)  
5 15 0.2%
 
6 6 0.1%
 
7 3 0.0%
 
11 1 0.0%
 
14 1 0.0%
 

VerizonUNL_plan_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.5774
Minimum 0
Maximum 14
Zeros (%) 35.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 3
Q3 4
95-th percentile 6
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.3543
Coef of variation 0.91342
Kurtosis -0.45352
Mean 2.5774
MAD 2.0311
Skewness 0.47325
Sum 23679
Variance 5.5427
Memory size 71.9 KiB
Value Count Frequency (%)  
0 3277 35.7%
 
4 1524 16.6%
 
3 1340 14.6%
 
5 1031 11.2%
 
2 776 8.4%
 
6 521 5.7%
 
1 276 3.0%
 
7 247 2.7%
 
8 105 1.1%
 
10 42 0.5%
 
Other values (4) 48 0.5%
 

Minimum 5 values

Value Count Frequency (%)  
0 3277 35.7%
 
1 276 3.0%
 
2 776 8.4%
 
3 1340 14.6%
 
4 1524 16.6%
 

Maximum 5 values

Value Count Frequency (%)  
9 34 0.4%
 
10 42 0.5%
 
11 11 0.1%
 
12 2 0.0%
 
14 1 0.0%
 

VoiceOnly_pplangrp_cnt
Highly correlated

This variable is highly correlated with SingleLine_plan_cnt and should be ignored for analysis

Correlation 0.92249

WearablDev_cnt
Numeric

Distinct count 8
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.34679
Minimum 0
Maximum 8
Zeros (%) 75.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 8
Range 8
Interquartile range 1

Descriptive statistics

Standard deviation 0.68729
Coef of variation 1.9818
Kurtosis 7.9117
Mean 0.34679
MAD 0.51995
Skewness 2.4023
Sum 3186
Variance 0.47236
Memory size 71.9 KiB
Value Count Frequency (%)  
0 6887 75.0%
 
1 1589 17.3%
 
2 579 6.3%
 
3 104 1.1%
 
4 20 0.2%
 
6 4 0.0%
 
5 3 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6887 75.0%
 
1 1589 17.3%
 
2 579 6.3%
 
3 104 1.1%
 
4 20 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
3 104 1.1%
 
4 20 0.2%
 
5 3 0.0%
 
6 4 0.0%
 
8 1 0.0%
 

access_rev_acrtn
Numeric

Distinct count 1457
Unique (%) 15.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -0.9712
Minimum -56.78
Maximum 21.38
Zeros (%) 31.7%

Quantile statistics

Minimum -56.78
5-th percentile -5.52
Q1 -2.38
Median -0.67
Q3 0
95-th percentile 4.11
Maximum 21.38
Range 78.16
Interquartile range 2.38

Descriptive statistics

Standard deviation 3.209
Coef of variation -3.3041
Kurtosis 22.336
Mean -0.9712
MAD 2.0205
Skewness -0.68663
Sum -8922.4
Variance 10.297
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 2913 31.7%
 
-1.26 79 0.9%
 
-1.22 60 0.7%
 
-1.47 58 0.6%
 
-1.9 56 0.6%
 
-1.31 56 0.6%
 
-1.36 50 0.5%
 
-2.5 46 0.5%
 
-1.83 45 0.5%
 
-0.98 42 0.5%
 
Other values (1447) 5782 62.9%
 

Minimum 5 values

Value Count Frequency (%)  
-56.78 1 0.0%
 
-50.32 1 0.0%
 
-27.5 1 0.0%
 
-27.38 1 0.0%
 
-25.29 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
16.44 1 0.0%
 
17.58 1 0.0%
 
19.41 1 0.0%
 
20.05 1 0.0%
 
21.38 1 0.0%
 

active_loan_cnt
Numeric

Distinct count 58
Unique (%) 0.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 15.867
Minimum 0
Maximum 169
Zeros (%) 0.8%

Quantile statistics

Minimum 0
5-th percentile 4
Q1 9
Median 15
Q3 20
95-th percentile 36
Maximum 169
Range 169
Interquartile range 11

Descriptive statistics

Standard deviation 11.052
Coef of variation 0.69655
Kurtosis 12.674
Mean 15.867
MAD 7.7759
Skewness 2.3511
Sum 145770
Variance 122.15
Memory size 71.9 KiB
Value Count Frequency (%)  
16 1337 14.6%
 
9 1225 13.3%
 
12 1013 11.0%
 
4 780 8.5%
 
20 704 7.7%
 
6 647 7.0%
 
25 542 5.9%
 
15 427 4.6%
 
8 397 4.3%
 
24 335 3.6%
 
Other values (48) 1780 19.4%
 

Minimum 5 values

Value Count Frequency (%)  
0 78 0.8%
 
1 8 0.1%
 
2 32 0.3%
 
3 34 0.4%
 
4 780 8.5%
 

Maximum 5 values

Value Count Frequency (%)  
100 1 0.0%
 
108 1 0.0%
 
110 1 0.0%
 
140 1 0.0%
 
169 1 0.0%
 

arpu_amt
Numeric

Distinct count 7258
Unique (%) 79.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 307.39
Minimum 135.3
Maximum 3843.4
Zeros (%) 0.0%

Quantile statistics

Minimum 135.3
5-th percentile 228.34
Q1 263.13
Median 300.01
Q3 338.42
95-th percentile 409.79
Maximum 3843.4
Range 3708.1
Interquartile range 75.285

Descriptive statistics

Standard deviation 87.46
Coef of variation 0.28452
Kurtosis 601.45
Mean 307.39
MAD 46.375
Skewness 18.364
Sum 2824000
Variance 7649.2
Memory size 71.9 KiB
Value Count Frequency (%)  
298.34 5 0.1%
 
314.79 5 0.1%
 
316.96 5 0.1%
 
256.54 5 0.1%
 
321.64 5 0.1%
 
296.94 5 0.1%
 
326.59 4 0.0%
 
286.34 4 0.0%
 
283.3 4 0.0%
 
292.49 4 0.0%
 
Other values (7248) 9141 99.5%
 

Minimum 5 values

Value Count Frequency (%)  
135.3 1 0.0%
 
164.97 1 0.0%
 
166.14 1 0.0%
 
177.53 1 0.0%
 
194.34 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2122.74 1 0.0%
 
2182.79 1 0.0%
 
3159.1 1 0.0%
 
3167.02 1 0.0%
 
3843.4 1 0.0%
 

billed_acct_size
Numeric

Distinct count 16
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 4.3669
Minimum 0
Maximum 16
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 2
Q1 3
Median 4
Q3 5
95-th percentile 7
Maximum 16
Range 16
Interquartile range 2

Descriptive statistics

Standard deviation 1.5453
Coef of variation 0.35387
Kurtosis 2.0147
Mean 4.3669
MAD 1.2024
Skewness 0.94162
Sum 40119
Variance 2.388
Memory size 71.9 KiB
Value Count Frequency (%)  
4 2731 29.7%
 
5 1963 21.4%
 
3 1902 20.7%
 
6 999 10.9%
 
2 787 8.6%
 
7 468 5.1%
 
8 183 2.0%
 
9 60 0.7%
 
10 55 0.6%
 
1 16 0.2%
 
Other values (6) 23 0.3%
 

Minimum 5 values

Value Count Frequency (%)  
0 1 0.0%
 
1 16 0.2%
 
2 787 8.6%
 
3 1902 20.7%
 
4 2731 29.7%
 

Maximum 5 values

Value Count Frequency (%)  
11 14 0.2%
 
12 3 0.0%
 
13 2 0.0%
 
14 2 0.0%
 
16 1 0.0%
 

birth_year
Numeric

Distinct count 74
Unique (%) 0.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1977
Minimum 0
Maximum 1999
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1954
Q1 1968
Median 1979
Q3 1988
95-th percentile 1996
Maximum 1999
Range 1999
Interquartile range 20

Descriptive statistics

Standard deviation 24.435
Coef of variation 0.01236
Kurtosis 4664.9
Mean 1977
MAD 11.058
Skewness -57.739
Sum 18162331
Variance 597.07
Memory size 71.9 KiB
Value Count Frequency (%)  
1989 267 2.9%
 
1990 267 2.9%
 
1983 263 2.9%
 
1987 262 2.9%
 
1984 261 2.8%
 
1988 260 2.8%
 
1986 257 2.8%
 
1991 255 2.8%
 
1982 249 2.7%
 
1980 248 2.7%
 
Other values (64) 6598 71.8%
 

Minimum 5 values

Value Count Frequency (%)  
0 1 0.0%
 
1919 3 0.0%
 
1920 2 0.0%
 
1924 1 0.0%
 
1926 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1995 197 2.1%
 
1996 147 1.6%
 
1997 141 1.5%
 
1998 106 1.2%
 
1999 75 0.8%
 

cust_status_ind
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.95733
1
8795
0
 
392
Value Count Frequency (%)  
1 8795 95.7%
 
0 392 4.3%
 

data_overage_amt
Numeric

Distinct count 324
Unique (%) 3.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.66383
Minimum -0.03
Maximum 1725
Zeros (%) 90.0%

Quantile statistics

Minimum -0.03
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2.122
Maximum 1725
Range 1725
Interquartile range 0

Descriptive statistics

Standard deviation 18.375
Coef of variation 27.68
Kurtosis 8447.4
Mean 0.66383
MAD 1.2025
Skewness 90.247
Sum 6098.6
Variance 337.64
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 8268 90.0%
 
0.48 38 0.4%
 
0.97 27 0.3%
 
0.5 22 0.2%
 
1.45 16 0.2%
 
0.56 16 0.2%
 
1.0 14 0.2%
 
2.5 14 0.2%
 
0.54 12 0.1%
 
0.52 12 0.1%
 
Other values (314) 748 8.1%
 

Minimum 5 values

Value Count Frequency (%)  
-0.03 1 0.0%
 
0.0 8268 90.0%
 
0.01 2 0.0%
 
0.02 2 0.0%
 
0.03 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
82.86 1 0.0%
 
86.54 1 0.0%
 
115.56 1 0.0%
 
224.55 1 0.0%
 
1725.0 1 0.0%
 

data_ovrg_Yes_cnt
Numeric

Distinct count 5
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.14074
Minimum 0
Maximum 4
Zeros (%) 89.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.46415
Coef of variation 3.2979
Kurtosis 16.109
Mean 0.14074
MAD 0.25293
Skewness 3.8218
Sum 1293
Variance 0.21544
Memory size 71.9 KiB
Value Count Frequency (%)  
0 8255 89.9%
 
1 638 6.9%
 
2 233 2.5%
 
3 55 0.6%
 
4 6 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 8255 89.9%
 
1 638 6.9%
 
2 233 2.5%
 
3 55 0.6%
 
4 6 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
0 8255 89.9%
 
1 638 6.9%
 
2 233 2.5%
 
3 55 0.6%
 
4 6 0.1%
 

eqp_rev_acrtn
Highly correlated

This variable is highly correlated with tot_revenue_acrtn and should be ignored for analysis

Correlation 0.97215

frequency
Numeric

Distinct count 28
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 25.975
Minimum 2
Maximum 54
Zeros (%) 0.0%

Quantile statistics

Minimum 2
5-th percentile 20
Q1 23
Median 26
Q3 29
95-th percentile 31
Maximum 54
Range 52
Interquartile range 6

Descriptive statistics

Standard deviation 3.5577
Coef of variation 0.13697
Kurtosis 1.8598
Mean 25.975
MAD 2.944
Skewness -0.45501
Sum 238633
Variance 12.657
Memory size 71.9 KiB
Value Count Frequency (%)  
30 1092 11.9%
 
29 1041 11.3%
 
28 909 9.9%
 
27 886 9.6%
 
26 797 8.7%
 
25 734 8.0%
 
23 673 7.3%
 
24 626 6.8%
 
31 615 6.7%
 
21 606 6.6%
 
Other values (18) 1208 13.1%
 

Minimum 5 values

Value Count Frequency (%)  
2 3 0.0%
 
4 5 0.1%
 
11 3 0.0%
 
12 4 0.0%
 
13 5 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
46 1 0.0%
 
49 1 0.0%
 
50 1 0.0%
 
51 1 0.0%
 
54 1 0.0%
 

gl_late_fees_amt
Numeric

Distinct count 2772
Unique (%) 30.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 46.485
Minimum -25
Maximum 315.55
Zeros (%) 24.5%

Quantile statistics

Minimum -25
5-th percentile 0
Q1 5
Median 27.83
Q3 90
95-th percentile 130.4
Maximum 315.55
Range 340.55
Interquartile range 85

Descriptive statistics

Standard deviation 47.695
Coef of variation 1.026
Kurtosis -0.86694
Mean 46.485
MAD 42.516
Skewness 0.64025
Sum 427060
Variance 2274.8
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 2249 24.5%
 
5.0 947 10.3%
 
10.0 436 4.7%
 
15.0 246 2.7%
 
20.0 169 1.8%
 
25.0 165 1.8%
 
30.0 155 1.7%
 
35.0 137 1.5%
 
60.0 100 1.1%
 
40.0 100 1.1%
 
Other values (2762) 4483 48.8%
 

Minimum 5 values

Value Count Frequency (%)  
-25.0 1 0.0%
 
-5.0 1 0.0%
 
0.0 2249 24.5%
 
0.01 1 0.0%
 
0.44 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
208.5 1 0.0%
 
220.34 1 0.0%
 
225.14 1 0.0%
 
233.14 1 0.0%
 
315.55 1 0.0%
 

gl_nonrecur_rev_amt
Numeric

Distinct count 6057
Unique (%) 65.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 173.44
Minimum -1168.4
Maximum 6911.1
Zeros (%) 7.6%

Quantile statistics

Minimum -1168.4
5-th percentile -5
Q1 12.965
Median 80.37
Q3 218.91
95-th percentile 709.92
Maximum 6911.1
Range 8079.5
Interquartile range 205.95

Descriptive statistics

Standard deviation 296.34
Coef of variation 1.7086
Kurtosis 53.899
Mean 173.44
MAD 183.11
Skewness 4.7263
Sum 1593400
Variance 87819
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 694 7.6%
 
5.0 252 2.7%
 
10.0 107 1.2%
 
1.99 99 1.1%
 
15.0 94 1.0%
 
25.0 59 0.6%
 
20.0 57 0.6%
 
30.0 51 0.6%
 
35.0 45 0.5%
 
6.99 41 0.4%
 
Other values (6047) 7688 83.7%
 

Minimum 5 values

Value Count Frequency (%)  
-1168.37 1 0.0%
 
-1125.0 1 0.0%
 
-910.42 1 0.0%
 
-722.0 1 0.0%
 
-623.55 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
3321.74 1 0.0%
 
3782.41 1 0.0%
 
4239.76 1 0.0%
 
4769.52 1 0.0%
 
6911.15 1 0.0%
 

handset_paychnl_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.6516
Minimum 0
Maximum 14
Zeros (%) 34.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 3
Q3 4
95-th percentile 6
Maximum 14
Range 14
Interquartile range 4

Descriptive statistics

Standard deviation 2.2961
Coef of variation 0.86594
Kurtosis -0.71131
Mean 2.6516
MAD 1.9826
Skewness 0.303
Sum 24360
Variance 5.2721
Memory size 71.9 KiB
Value Count Frequency (%)  
0 3193 34.8%
 
4 1742 19.0%
 
3 1380 15.0%
 
5 1083 11.8%
 
2 793 8.6%
 
6 526 5.7%
 
7 238 2.6%
 
8 90 1.0%
 
1 75 0.8%
 
9 46 0.5%
 
Other values (4) 21 0.2%
 

Minimum 5 values

Value Count Frequency (%)  
0 3193 34.8%
 
1 75 0.8%
 
2 793 8.6%
 
3 1380 15.0%
 
4 1742 19.0%
 

Maximum 5 values

Value Count Frequency (%)  
9 46 0.5%
 
10 15 0.2%
 
11 4 0.0%
 
12 1 0.0%
 
14 1 0.0%
 

index
Numeric

Distinct count 9187
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 381690
Minimum 139
Maximum 758123
Zeros (%) 0.0%

Quantile statistics

Minimum 139
5-th percentile 39583
Q1 191720
Median 385210
Q3 575470
95-th percentile 719140
Maximum 758123
Range 757984
Interquartile range 383750

Descriptive statistics

Standard deviation 219240
Coef of variation 0.57439
Kurtosis -1.2072
Mean 381690
MAD 190160
Skewness -0.01511
Sum 3506578398
Variance 48065000000
Memory size 71.9 KiB
Value Count Frequency (%)  
1 644194 7012.0%
 
1 490658 5340.8%
 
1 528789 5755.8%
 
1 18287 199.1%
 
1 270832 2948.0%
 
1 427306 4651.2%
 
1 302377 3291.4%
 
1 54568 594.0%
 
1 277797 3023.8%
 
1 552191 6010.6%
 
Other values (9177) 3503011399 38130090.3%
 

Minimum 5 values

Value Count Frequency (%)  
1 644194 7012.0%
 
1 490658 5340.8%
 
1 528789 5755.8%
 
1 18287 199.1%
 
1 270832 2948.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 187022 2035.7%
 
1 649872 7073.8%
 
1 303761 3306.4%
 
1 78646 856.1%
 
1 163840 1783.4%
 

monetary
Numeric

Distinct count 9061
Unique (%) 98.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 7801.9
Minimum 6318.2
Maximum 9993.6
Zeros (%) 0.0%

Quantile statistics

Minimum 6318.2
5-th percentile 6495.7
Q1 6952.9
Median 7631.9
Q3 8561.6
95-th percentile 9602.4
Maximum 9993.6
Range 3675.4
Interquartile range 1608.8

Descriptive statistics

Standard deviation 984.98
Coef of variation 0.12625
Kurtosis -0.90437
Mean 7801.9
MAD 839.99
Skewness 0.45254
Sum 71676000
Variance 970190
Memory size 71.9 KiB
Value Count Frequency (%)  
6514.28 3 0.0%
 
8100.54 2 0.0%
 
6851.26 2 0.0%
 
6667.05 2 0.0%
 
7538.91 2 0.0%
 
6717.0 2 0.0%
 
7943.44 2 0.0%
 
6907.37 2 0.0%
 
7217.57 2 0.0%
 
7382.89 2 0.0%
 
Other values (9051) 9166 99.8%
 

Minimum 5 values

Value Count Frequency (%)  
6318.21 1 0.0%
 
6334.04 1 0.0%
 
6373.93 1 0.0%
 
6375.98 1 0.0%
 
6385.52 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
9990.51 1 0.0%
 
9991.44 1 0.0%
 
9992.81 1 0.0%
 
9993.3 1 0.0%
 
9993.61 1 0.0%
 

pplan_access_acrtn
Numeric

Distinct count 576
Unique (%) 6.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 10.436
Minimum -2.92
Maximum 440
Zeros (%) 35.0%

Quantile statistics

Minimum -2.92
5-th percentile 0
Q1 0
Median 8.46
Q3 17.6
95-th percentile 29.33
Maximum 440
Range 442.92
Interquartile range 17.6

Descriptive statistics

Standard deviation 12.204
Coef of variation 1.1693
Kurtosis 178.01
Mean 10.436
MAD 9.1759
Skewness 6.2669
Sum 95879
Variance 148.93
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 3213 35.0%
 
18.33 242 2.6%
 
11.38 191 2.1%
 
14.67 178 1.9%
 
15.71 176 1.9%
 
22.0 174 1.9%
 
15.17 173 1.9%
 
16.3 169 1.8%
 
16.92 166 1.8%
 
11.0 161 1.8%
 
Other values (566) 4344 47.3%
 

Minimum 5 values

Value Count Frequency (%)  
-2.92 1 0.0%
 
-2.71 1 0.0%
 
-2.5 1 0.0%
 
-2.1 1 0.0%
 
-1.75 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
86.43 2 0.0%
 
96.25 1 0.0%
 
101.54 1 0.0%
 
220.0 1 0.0%
 
440.0 1 0.0%
 

recency
Numeric

Distinct count 23
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 59.43
Minimum 51
Maximum 842
Zeros (%) 0.0%

Quantile statistics

Minimum 51
5-th percentile 51
Q1 51
Median 51
Q3 51
95-th percentile 81
Maximum 842
Range 791
Interquartile range 0

Descriptive statistics

Standard deviation 45.449
Coef of variation 0.76474
Kurtosis 75.871
Mean 59.43
MAD 15.938
Skewness 7.6664
Sum 545988
Variance 2065.6
Memory size 71.9 KiB
Value Count Frequency (%)  
51 8684 94.5%
 
81 90 1.0%
 
173 66 0.7%
 
112 63 0.7%
 
142 58 0.6%
 
232 48 0.5%
 
201 45 0.5%
 
263 31 0.3%
 
293 23 0.3%
 
354 18 0.2%
 
Other values (13) 61 0.7%
 

Minimum 5 values

Value Count Frequency (%)  
51 8684 94.5%
 
81 90 1.0%
 
112 63 0.7%
 
142 58 0.6%
 
173 66 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
597 1 0.0%
 
628 2 0.0%
 
658 4 0.0%
 
811 1 0.0%
 
842 2 0.0%
 

recurring_paychnl_cnt
Numeric

Distinct count 14
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 3.1457
Minimum 0
Maximum 13
Zeros (%) 26.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 3
Q3 5
95-th percentile 7
Maximum 13
Range 13
Interquartile range 5

Descriptive statistics

Standard deviation 2.3458
Coef of variation 0.7457
Kurtosis -0.46865
Mean 3.1457
MAD 1.9257
Skewness 0.14833
Sum 28900
Variance 5.5027
Memory size 71.9 KiB
Value Count Frequency (%)  
0 2466 26.8%
 
4 1923 20.9%
 
3 1439 15.7%
 
5 1361 14.8%
 
6 685 7.5%
 
2 673 7.3%
 
7 340 3.7%
 
8 140 1.5%
 
1 50 0.5%
 
9 48 0.5%
 
Other values (4) 62 0.7%
 

Minimum 5 values

Value Count Frequency (%)  
0 2466 26.8%
 
1 50 0.5%
 
2 673 7.3%
 
3 1439 15.7%
 
4 1923 20.9%
 

Maximum 5 values

Value Count Frequency (%)  
9 48 0.5%
 
10 42 0.5%
 
11 13 0.1%
 
12 6 0.1%
 
13 1 0.0%
 

tot_revenue_acrtn
Numeric

Distinct count 3241
Unique (%) 35.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -10.763
Minimum -3165.3
Maximum 1923
Zeros (%) 0.1%

Quantile statistics

Minimum -3165.3
5-th percentile -30.37
Q1 -14.31
Median -8.68
Q3 -4.66
95-th percentile 0.827
Maximum 1923
Range 5088.4
Interquartile range 9.65

Descriptive statistics

Standard deviation 42.119
Coef of variation -3.9135
Kurtosis 3957.4
Mean -10.763
MAD 8.0933
Skewness -33.286
Sum -98875
Variance 1774
Memory size 71.9 KiB
Value Count Frequency (%)  
-7.07 16 0.2%
 
-7.33 14 0.2%
 
-5.74 13 0.1%
 
-6.41 13 0.1%
 
-5.99 12 0.1%
 
-6.42 12 0.1%
 
-7.56 12 0.1%
 
-7.54 12 0.1%
 
-8.3 12 0.1%
 
-9.19 12 0.1%
 
Other values (3231) 9059 98.6%
 

Minimum 5 values

Value Count Frequency (%)  
-3165.34 1 0.0%
 
-209.03 1 0.0%
 
-177.9 1 0.0%
 
-141.41 1 0.0%
 
-120.58 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
75.41 1 0.0%
 
99.97 1 0.0%
 
473.63 1 0.0%
 
1065.21 1 0.0%
 
1923.02 1 0.0%
 

usage_rev_acrtn
Numeric

Distinct count 424
Unique (%) 4.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.065239
Minimum -32.37
Maximum 111.1
Zeros (%) 84.3%

Quantile statistics

Minimum -32.37
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.41
Maximum 111.1
Range 143.47
Interquartile range 0

Descriptive statistics

Standard deviation 1.7148
Coef of variation 26.286
Kurtosis 2182.8
Mean 0.065239
MAD 0.23287
Skewness 35.313
Sum 599.35
Variance 2.9407
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 7749 84.3%
 
0.09 69 0.8%
 
0.07 59 0.6%
 
0.08 48 0.5%
 
0.1 31 0.3%
 
0.17 25 0.3%
 
-0.07 24 0.3%
 
0.34 22 0.2%
 
0.32 22 0.2%
 
0.14 21 0.2%
 
Other values (414) 1117 12.2%
 

Minimum 5 values

Value Count Frequency (%)  
-32.37 1 0.0%
 
-30.91 1 0.0%
 
-18.37 1 0.0%
 
-17.82 1 0.0%
 
-12.35 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
24.79 1 0.0%
 
31.15 1 0.0%
 
32.94 1 0.0%
 
63.68 1 0.0%
 
111.1 1 0.0%
 

voice_overage_amt
Numeric

Distinct count 500
Unique (%) 5.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.50609
Minimum 0
Maximum 393.05
Zeros (%) 84.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1.79
Maximum 393.05
Range 393.05
Interquartile range 0

Descriptive statistics

Standard deviation 5.028
Coef of variation 9.9348
Kurtosis 4088.1
Mean 0.50609
MAD 0.89301
Skewness 54.863
Sum 4649.5
Variance 25.28
Memory size 71.9 KiB
Value Count Frequency (%)  
0.0 7784 84.7%
 
0.06 47 0.5%
 
0.07 40 0.4%
 
0.12 30 0.3%
 
0.21 22 0.2%
 
0.08 20 0.2%
 
0.09 20 0.2%
 
0.14 17 0.2%
 
0.19 17 0.2%
 
0.18 16 0.2%
 
Other values (490) 1174 12.8%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 7784 84.7%
 
0.01 1 0.0%
 
0.02 2 0.0%
 
0.03 4 0.0%
 
0.04 7 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
54.57 1 0.0%
 
63.68 1 0.0%
 
66.36 1 0.0%
 
113.85 1 0.0%
 
393.05 1 0.0%
 

Correlations

Sample

cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
13719 1 51 28 6943.23 0 4 4 8 1975 0 0 3 0 0 0 0 2 0 2 0 0 4 0 0 0 0 0 4 4 2 4 0 0 0 0 0 247.97 35.00 35.00 15.71 -0.92 0.00 1.49 0.0 0.0 0.00 1
737737 1 51 24 7281.24 0 4 4 16 1980 0 0 2 3 0 0 0 4 4 2 0 0 4 0 0 0 4 0 4 0 0 4 0 0 0 0 0 303.38 91.06 456.92 18.33 -5.99 0.00 0.10 0.0 0.0 0.00 1
592879 1 51 23 7871.23 0 0 9 9 1965 0 9 5 9 9 0 1 1 0 0 4 0 0 5 0 0 0 0 0 0 8 8 0 0 0 0 0 342.23 0.00 219.63 0.00 -10.99 0.00 0.00 0.0 0.0 0.31 1
153277 1 51 22 8663.82 0 0 4 12 1998 0 4 2 0 4 0 0 3 0 2 0 0 0 1 1 0 4 0 0 4 1 2 1 0 4 1 0 393.81 95.00 495.00 0.00 -4.10 0.00 0.00 0.0 0.0 0.00 1
627818 1 51 26 7183.53 2 4 4 8 1989 0 4 2 4 0 0 0 2 0 0 2 0 4 0 2 0 4 0 4 4 2 2 0 0 0 0 0 276.29 0.00 85.65 0.77 1.04 1.38 0.48 0.0 0.0 0.00 1

Cluster 1 - Platinum Findings:

1. Recency - High (Mean - 58.759)
2. Frequency - High (Mean - 25.9)
3. Monetary - High (Mean - 7790)
6. IphoneDev_cnt - 87.2%
10. ARPU Amount - 305.87 (Mean)
11. billed_acct_size - 4.36 (Mean)
12. Birth Year - 1977 (Mean)
13. % of AT&T port in - 33.5%
14. % of Sprint_port in - 25.8%
15. % of T mobile port in - 22.7%
16. % of Android Devices - 58.8%
17. CA users - 46.8%
18. TX users - 35%
19. CommunicStore_chnl_cnt - 65.1%
20. Edge_contract_cnt - 99.2% (Mean - 3 lines per customer)
22. SingleLine_plan_cnt - 2.9%
23. data_ovrg_Yes_cnt - 11%
25. Hispanic Ethnicity - 38.1%
26. MTM_contract_cnt - 95.7%
27. Purch_equip_cnt - 43.5%
28. VerizonUNL_plan_cnt - 64.3%
29. WearablDev_cnt - 24.8%
In [44]:
df[df['Customer_Cluster']== 1].head(10)
Out[44]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
13719 1 51 28 6943.23 0 4 4 8 1975 0 0 3 0 0 0 0 2 0 2 0 0 4 0 0 0 0 0 4 4 2 4 0 0 0 0 0 247.97 35.00 35.00 15.71 -0.92 0.00 1.49 0.0 0.00 0.00 1
737737 1 51 24 7281.24 0 4 4 16 1980 0 0 2 3 0 0 0 4 4 2 0 0 4 0 0 0 4 0 4 0 0 4 0 0 0 0 0 303.38 91.06 456.92 18.33 -5.99 0.00 0.10 0.0 0.00 0.00 1
592879 1 51 23 7871.23 0 0 9 9 1965 0 9 5 9 9 0 1 1 0 0 4 0 0 5 0 0 0 0 0 0 8 8 0 0 0 0 0 342.23 0.00 219.63 0.00 -10.99 0.00 0.00 0.0 0.00 0.31 1
153277 1 51 22 8663.82 0 0 4 12 1998 0 4 2 0 4 0 0 3 0 2 0 0 0 1 1 0 4 0 0 4 1 2 1 0 4 1 0 393.81 95.00 495.00 0.00 -4.10 0.00 0.00 0.0 0.00 0.00 1
627818 1 51 26 7183.53 2 4 4 8 1989 0 4 2 4 0 0 0 2 0 0 2 0 4 0 2 0 4 0 4 4 2 2 0 0 0 0 0 276.29 0.00 85.65 0.77 1.04 1.38 0.48 0.0 0.00 0.00 1
573522 1 51 21 9441.34 0 0 8 42 1985 7 8 5 8 8 8 1 6 0 1 1 0 0 0 1 0 6 0 8 0 1 4 0 2 1 1 0 449.59 0.00 211.18 0.48 -1.69 8.81 26.36 0.0 4.29 0.00 1
306795 1 51 27 8323.86 0 4 4 16 1969 0 0 4 4 4 4 0 4 4 0 0 0 0 3 0 0 4 0 4 4 0 4 0 0 0 0 0 308.29 30.00 30.00 16.30 -12.41 -1.02 -4.37 0.0 0.00 0.00 1
271327 1 51 30 8204.49 0 1 4 8 1994 4 0 2 4 0 4 0 2 4 0 1 0 4 0 1 0 3 0 0 0 1 1 1 0 0 1 0 273.48 5.00 5.00 0.00 -7.00 -2.17 -0.73 0.0 0.00 0.00 1
387759 1 51 26 9927.63 0 6 6 30 1957 0 0 5 6 0 0 0 5 6 1 1 0 6 0 1 6 0 0 0 0 1 6 0 5 0 0 0 381.83 0.00 50.32 29.62 -9.50 -0.60 -2.85 0.0 0.00 1.86 1
122026 1 51 29 9951.52 0 4 4 16 1982 0 4 1 4 4 4 0 4 2 3 0 0 0 0 0 0 4 0 0 4 0 1 4 0 2 0 0 343.16 0.00 5.96 0.00 -11.38 -1.92 -0.27 0.0 0.00 0.00 1

Cluster 1 has customers making on-time payments and are loyal. We call this cluster as "Platinum"

Cluster 2:

In [45]:
profile3 = pandas_profiling.ProfileReport(df[df['Customer_Cluster']== 2])
profile3
Out[45]:

Overview

Dataset info

Number of variables 48
Number of observations 19777
Total Missing (%) 0.0%
Total size in memory 7.2 MiB
Average record size in memory 380.0 B

Variables types

Numeric 46
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 1
Unsupported 0

Warnings

Variables

ATT_port_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.16757
Minimum 0
Maximum 6
Zeros (%) 87.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 6
Range 6
Interquartile range 0

Descriptive statistics

Standard deviation 0.49638
Coef of variation 2.9623
Kurtosis 22.436
Mean 0.16757
MAD 0.29142
Skewness 4.0528
Sum 3314
Variance 0.24639
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17197 87.0%
 
1 2069 10.5%
 
2 367 1.9%
 
3 82 0.4%
 
4 49 0.2%
 
5 9 0.0%
 
6 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17197 87.0%
 
1 2069 10.5%
 
2 367 1.9%
 
3 82 0.4%
 
4 49 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
2 367 1.9%
 
3 82 0.4%
 
4 49 0.2%
 
5 9 0.0%
 
6 4 0.0%
 

AndroidDev_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.43146
Minimum 0
Maximum 27
Zeros (%) 65.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 27
Range 27
Interquartile range 1

Descriptive statistics

Standard deviation 0.72572
Coef of variation 1.682
Kurtosis 100.9
Mean 0.43146
MAD 0.56543
Skewness 4.6732
Sum 8533
Variance 0.52667
Memory size 154.6 KiB
Value Count Frequency (%)  
0 12959 65.5%
 
1 5582 28.2%
 
2 946 4.8%
 
3 187 0.9%
 
4 68 0.3%
 
5 20 0.1%
 
6 8 0.0%
 
10 2 0.0%
 
7 2 0.0%
 
27 1 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 12959 65.5%
 
1 5582 28.2%
 
2 946 4.8%
 
3 187 0.9%
 
4 68 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
7 2 0.0%
 
8 1 0.0%
 
9 1 0.0%
 
10 2 0.0%
 
27 1 0.0%
 

BasicDev_cnt
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.10027
Minimum 0
Maximum 5
Zeros (%) 90.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 5
Range 5
Interquartile range 0

Descriptive statistics

Standard deviation 0.32912
Coef of variation 3.2824
Kurtosis 16.531
Mean 0.10027
MAD 0.18203
Skewness 3.6324
Sum 1983
Variance 0.10832
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17952 90.8%
 
1 1682 8.5%
 
2 133 0.7%
 
3 6 0.0%
 
4 3 0.0%
 
5 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17952 90.8%
 
1 1682 8.5%
 
2 133 0.7%
 
3 6 0.0%
 
4 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 1682 8.5%
 
2 133 0.7%
 
3 6 0.0%
 
4 3 0.0%
 
5 1 0.0%
 

BeyondUnlmtd_plan_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.18709
Minimum 0
Maximum 7
Zeros (%) 89.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.67797
Coef of variation 3.6238
Kurtosis 25.92
Mean 0.18709
MAD 0.33628
Skewness 4.734
Sum 3700
Variance 0.45964
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17774 89.9%
 
1 1119 5.7%
 
2 443 2.2%
 
3 192 1.0%
 
4 156 0.8%
 
5 69 0.3%
 
6 18 0.1%
 
7 6 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17774 89.9%
 
1 1119 5.7%
 
2 443 2.2%
 
3 192 1.0%
 
4 156 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
3 192 1.0%
 
4 156 0.8%
 
5 69 0.3%
 
6 18 0.1%
 
7 6 0.0%
 

CA_state_cd
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.77519
Minimum 0
Maximum 10
Zeros (%) 50.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 1.0869
Coef of variation 1.4022
Kurtosis 7.0095
Mean 0.77519
MAD 0.78574
Skewness 2.2165
Sum 15331
Variance 1.1815
Memory size 154.6 KiB
Value Count Frequency (%)  
0 10023 50.7%
 
1 6658 33.7%
 
2 1785 9.0%
 
3 600 3.0%
 
4 415 2.1%
 
5 199 1.0%
 
6 58 0.3%
 
7 24 0.1%
 
8 7 0.0%
 
10 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10023 50.7%
 
1 6658 33.7%
 
2 1785 9.0%
 
3 600 3.0%
 
4 415 2.1%
 

Maximum 5 values

Value Count Frequency (%)  
6 58 0.3%
 
7 24 0.1%
 
8 7 0.0%
 
9 4 0.0%
 
10 4 0.0%
 

CommunicStore_chnl_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.61
Minimum 0
Maximum 24
Zeros (%) 59.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 24
Range 24
Interquartile range 1

Descriptive statistics

Standard deviation 0.9633
Coef of variation 1.5792
Kurtosis 24.933
Mean 0.61
MAD 0.72459
Skewness 2.9259
Sum 12064
Variance 0.92794
Memory size 154.6 KiB
Value Count Frequency (%)  
0 11746 59.4%
 
1 5566 28.1%
 
2 1564 7.9%
 
3 475 2.4%
 
4 271 1.4%
 
5 116 0.6%
 
6 23 0.1%
 
7 8 0.0%
 
8 4 0.0%
 
12 1 0.0%
 
Other values (3) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 11746 59.4%
 
1 5566 28.1%
 
2 1564 7.9%
 
3 475 2.4%
 
4 271 1.4%
 

Maximum 5 values

Value Count Frequency (%)  
8 4 0.0%
 
9 1 0.0%
 
10 1 0.0%
 
12 1 0.0%
 
24 1 0.0%
 

Customer_Cluster
Constant

This variable is constant and should be ignored for analysis

Constant value 2

DataOnly_Revtype_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.23542
Minimum 0
Maximum 10
Zeros (%) 79.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.51749
Coef of variation 2.1981
Kurtosis 17.227
Mean 0.23542
MAD 0.37524
Skewness 2.9901
Sum 4656
Variance 0.26779
Memory size 154.6 KiB
Value Count Frequency (%)  
0 15761 79.7%
 
1 3519 17.8%
 
2 403 2.0%
 
3 63 0.3%
 
4 20 0.1%
 
5 9 0.0%
 
10 1 0.0%
 
7 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15761 79.7%
 
1 3519 17.8%
 
2 403 2.0%
 
3 63 0.3%
 
4 20 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
3 63 0.3%
 
4 20 0.1%
 
5 9 0.0%
 
7 1 0.0%
 
10 1 0.0%
 

Edge_contract_cnt
Numeric

Distinct count 9
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.65662
Minimum 0
Maximum 9
Zeros (%) 55.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 9
Range 9
Interquartile range 1

Descriptive statistics

Standard deviation 0.95497
Coef of variation 1.4544
Kurtosis 5.7216
Mean 0.65662
MAD 0.72399
Skewness 2.1023
Sum 12986
Variance 0.91197
Memory size 154.6 KiB
Value Count Frequency (%)  
0 10903 55.1%
 
1 6475 32.7%
 
2 1407 7.1%
 
3 471 2.4%
 
4 354 1.8%
 
5 140 0.7%
 
6 23 0.1%
 
7 3 0.0%
 
9 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10903 55.1%
 
1 6475 32.7%
 
2 1407 7.1%
 
3 471 2.4%
 
4 354 1.8%
 

Maximum 5 values

Value Count Frequency (%)  
4 354 1.8%
 
5 140 0.7%
 
6 23 0.1%
 
7 3 0.0%
 
9 1 0.0%
 

FamTeens_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.29767
Minimum 0
Maximum 12
Zeros (%) 80.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 12
Range 12
Interquartile range 0

Descriptive statistics

Standard deviation 0.75452
Coef of variation 2.5348
Kurtosis 20.667
Mean 0.29767
MAD 0.47824
Skewness 3.8501
Sum 5887
Variance 0.56931
Memory size 154.6 KiB
Value Count Frequency (%)  
0 15887 80.3%
 
1 2788 14.1%
 
2 619 3.1%
 
3 231 1.2%
 
4 149 0.8%
 
5 69 0.3%
 
6 19 0.1%
 
7 11 0.1%
 
8 3 0.0%
 
12 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15887 80.3%
 
1 2788 14.1%
 
2 619 3.1%
 
3 231 1.2%
 
4 149 0.8%
 

Maximum 5 values

Value Count Frequency (%)  
5 69 0.3%
 
6 19 0.1%
 
7 11 0.1%
 
8 3 0.0%
 
12 1 0.0%
 

GoUnlimit_plan_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.23477
Minimum 0
Maximum 13
Zeros (%) 86.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 0.73153
Coef of variation 3.116
Kurtosis 26.413
Mean 0.23477
MAD 0.4056
Skewness 4.428
Sum 4643
Variance 0.53514
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17084 86.4%
 
1 1645 8.3%
 
2 557 2.8%
 
3 226 1.1%
 
4 171 0.9%
 
5 66 0.3%
 
6 16 0.1%
 
7 7 0.0%
 
8 3 0.0%
 
13 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17084 86.4%
 
1 1645 8.3%
 
2 557 2.8%
 
3 226 1.1%
 
4 171 0.9%
 

Maximum 5 values

Value Count Frequency (%)  
6 16 0.1%
 
7 7 0.0%
 
8 3 0.0%
 
10 1 0.0%
 
13 1 0.0%
 

Hispanic_ethnicity_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.44461
Minimum 0
Maximum 10
Zeros (%) 73.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.94219
Coef of variation 2.1191
Kurtosis 11.784
Mean 0.44461
MAD 0.65253
Skewness 3.0216
Sum 8793
Variance 0.88772
Memory size 154.6 KiB
Value Count Frequency (%)  
0 14513 73.4%
 
1 3349 16.9%
 
2 1046 5.3%
 
3 407 2.1%
 
4 273 1.4%
 
5 139 0.7%
 
6 25 0.1%
 
7 13 0.1%
 
8 6 0.0%
 
9 5 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14513 73.4%
 
1 3349 16.9%
 
2 1046 5.3%
 
3 407 2.1%
 
4 273 1.4%
 

Maximum 5 values

Value Count Frequency (%)  
6 25 0.1%
 
7 13 0.1%
 
8 6 0.0%
 
9 5 0.0%
 
10 1 0.0%
 

IphoneDev_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.7755
Minimum 0
Maximum 10
Zeros (%) 48.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 1
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 1.0175
Coef of variation 1.3121
Kurtosis 4.6696
Mean 0.7755
MAD 0.74581
Skewness 1.9158
Sum 15337
Variance 1.0354
Memory size 154.6 KiB
Value Count Frequency (%)  
0 9510 48.1%
 
1 7412 37.5%
 
2 1576 8.0%
 
3 582 2.9%
 
4 497 2.5%
 
5 171 0.9%
 
6 23 0.1%
 
7 4 0.0%
 
10 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 9510 48.1%
 
1 7412 37.5%
 
2 1576 8.0%
 
3 582 2.9%
 
4 497 2.5%
 

Maximum 5 values

Value Count Frequency (%)  
5 171 0.9%
 
6 23 0.1%
 
7 4 0.0%
 
8 1 0.0%
 
10 1 0.0%
 

LocalRetailer_chnl_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.25646
Minimum 0
Maximum 9
Zeros (%) 84.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 9
Range 9
Interquartile range 0

Descriptive statistics

Standard deviation 0.72495
Coef of variation 2.8267
Kurtosis 21.472
Mean 0.25646
MAD 0.43102
Skewness 4.0537
Sum 5072
Variance 0.52555
Memory size 154.6 KiB
Value Count Frequency (%)  
0 16619 84.0%
 
1 2052 10.4%
 
2 665 3.4%
 
3 218 1.1%
 
4 130 0.7%
 
5 62 0.3%
 
6 16 0.1%
 
7 12 0.1%
 
9 2 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 16619 84.0%
 
1 2052 10.4%
 
2 665 3.4%
 
3 218 1.1%
 
4 130 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
5 62 0.3%
 
6 16 0.1%
 
7 12 0.1%
 
8 1 0.0%
 
9 2 0.0%
 

MTM_contract_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1913
Minimum 0
Maximum 30
Zeros (%) 19.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 1
95-th percentile 3
Maximum 30
Range 30
Interquartile range 0

Descriptive statistics

Standard deviation 1.0614
Coef of variation 0.89098
Kurtosis 32.69
Mean 1.1913
MAD 0.68945
Skewness 2.7993
Sum 23560
Variance 1.1266
Memory size 154.6 KiB
Value Count Frequency (%)  
1 11496 58.1%
 
0 3877 19.6%
 
2 2610 13.2%
 
3 809 4.1%
 
4 647 3.3%
 
5 270 1.4%
 
6 46 0.2%
 
7 10 0.1%
 
10 5 0.0%
 
8 4 0.0%
 
Other values (3) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 3877 19.6%
 
1 11496 58.1%
 
2 2610 13.2%
 
3 809 4.1%
 
4 647 3.3%
 

Maximum 5 values

Value Count Frequency (%)  
8 4 0.0%
 
9 1 0.0%
 
10 5 0.0%
 
12 1 0.0%
 
30 1 0.0%
 

MatureAdults_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.33655
Minimum 0
Maximum 9
Zeros (%) 76.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 9
Range 9
Interquartile range 0

Descriptive statistics

Standard deviation 0.72671
Coef of variation 2.1593
Kurtosis 13.61
Mean 0.33655
MAD 0.51209
Skewness 3.1057
Sum 6656
Variance 0.52811
Memory size 154.6 KiB
Value Count Frequency (%)  
0 15046 76.1%
 
1 3488 17.6%
 
2 837 4.2%
 
3 219 1.1%
 
4 126 0.6%
 
5 43 0.2%
 
6 11 0.1%
 
7 5 0.0%
 
9 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15046 76.1%
 
1 3488 17.6%
 
2 837 4.2%
 
3 219 1.1%
 
4 126 0.6%
 

Maximum 5 values

Value Count Frequency (%)  
5 43 0.2%
 
6 11 0.1%
 
7 5 0.0%
 
8 1 0.0%
 
9 1 0.0%
 

NewVZPlan_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.64995
Minimum 0
Maximum 30
Zeros (%) 54.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 30
Range 30
Interquartile range 1

Descriptive statistics

Standard deviation 0.93724
Coef of variation 1.442
Kurtosis 56.178
Mean 0.64995
MAD 0.70368
Skewness 3.6314
Sum 12854
Variance 0.87842
Memory size 154.6 KiB
Value Count Frequency (%)  
0 10706 54.1%
 
1 6652 33.6%
 
2 1642 8.3%
 
3 426 2.2%
 
4 216 1.1%
 
5 91 0.5%
 
6 28 0.1%
 
7 7 0.0%
 
10 4 0.0%
 
9 2 0.0%
 
Other values (2) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10706 54.1%
 
1 6652 33.6%
 
2 1642 8.3%
 
3 426 2.2%
 
4 216 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 7 0.0%
 
8 2 0.0%
 
9 2 0.0%
 
10 4 0.0%
 
30 1 0.0%
 

Purch_equip_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.53244
Minimum 0
Maximum 10
Zeros (%) 56.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.71872
Coef of variation 1.3499
Kurtosis 10.981
Mean 0.53244
MAD 0.59724
Skewness 2.1135
Sum 10530
Variance 0.51656
Memory size 154.6 KiB
Value Count Frequency (%)  
0 11092 56.1%
 
1 7324 37.0%
 
2 1056 5.3%
 
3 192 1.0%
 
4 79 0.4%
 
5 21 0.1%
 
6 5 0.0%
 
7 4 0.0%
 
10 3 0.0%
 
9 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 11092 56.1%
 
1 7324 37.0%
 
2 1056 5.3%
 
3 192 1.0%
 
4 79 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
5 21 0.1%
 
6 5 0.0%
 
7 4 0.0%
 
9 1 0.0%
 
10 3 0.0%
 

SingleLine_plan_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.14648
Minimum 0
Maximum 7
Zeros (%) 86.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.40318
Coef of variation 2.7524
Kurtosis 16.751
Mean 0.14648
MAD 0.25454
Skewness 3.3058
Sum 2897
Variance 0.16255
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17183 86.9%
 
1 2334 11.8%
 
2 233 1.2%
 
3 17 0.1%
 
4 6 0.0%
 
5 3 0.0%
 
7 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17183 86.9%
 
1 2334 11.8%
 
2 233 1.2%
 
3 17 0.1%
 
4 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2 233 1.2%
 
3 17 0.1%
 
4 6 0.0%
 
5 3 0.0%
 
7 1 0.0%
 

SngleCpleKids_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.51737
Minimum 0
Maximum 13
Zeros (%) 67.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 13
Range 13
Interquartile range 1

Descriptive statistics

Standard deviation 0.96564
Coef of variation 1.8664
Kurtosis 11.765
Mean 0.51737
MAD 0.69748
Skewness 2.8363
Sum 10232
Variance 0.93246
Memory size 154.6 KiB
Value Count Frequency (%)  
0 13331 67.4%
 
1 4341 21.9%
 
2 1207 6.1%
 
3 411 2.1%
 
4 292 1.5%
 
5 138 0.7%
 
6 38 0.2%
 
7 10 0.1%
 
10 3 0.0%
 
8 3 0.0%
 
Other values (3) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13331 67.4%
 
1 4341 21.9%
 
2 1207 6.1%
 
3 411 2.1%
 
4 292 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
8 3 0.0%
 
9 1 0.0%
 
10 3 0.0%
 
12 1 0.0%
 
13 1 0.0%
 

Sprint_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.090914
Minimum 0
Maximum 10
Zeros (%) 93.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.4154
Coef of variation 4.5692
Kurtosis 84.423
Mean 0.090914
MAD 0.16994
Skewness 7.4038
Sum 1798
Variance 0.17256
Memory size 154.6 KiB
Value Count Frequency (%)  
0 18484 93.5%
 
1 998 5.0%
 
2 179 0.9%
 
3 63 0.3%
 
4 28 0.1%
 
5 20 0.1%
 
9 2 0.0%
 
10 1 0.0%
 
7 1 0.0%
 
6 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 18484 93.5%
 
1 998 5.0%
 
2 179 0.9%
 
3 63 0.3%
 
4 28 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
5 20 0.1%
 
6 1 0.0%
 
7 1 0.0%
 
9 2 0.0%
 
10 1 0.0%
 

TX_state_cd
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.4173
Minimum 0
Maximum 30
Zeros (%) 73.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 30
Range 30
Interquartile range 1

Descriptive statistics

Standard deviation 0.89103
Coef of variation 2.1352
Kurtosis 73.317
Mean 0.4173
MAD 0.61098
Skewness 4.6752
Sum 8253
Variance 0.79394
Memory size 154.6 KiB
Value Count Frequency (%)  
0 14478 73.2%
 
1 3592 18.2%
 
2 1031 5.2%
 
3 329 1.7%
 
4 212 1.1%
 
5 99 0.5%
 
6 19 0.1%
 
7 11 0.1%
 
8 2 0.0%
 
30 1 0.0%
 
Other values (3) 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14478 73.2%
 
1 3592 18.2%
 
2 1031 5.2%
 
3 329 1.7%
 
4 212 1.1%
 

Maximum 5 values

Value Count Frequency (%)  
8 2 0.0%
 
9 1 0.0%
 
10 1 0.0%
 
13 1 0.0%
 
30 1 0.0%
 

T_Mobile_port_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.16818
Minimum 0
Maximum 6
Zeros (%) 87.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 6
Range 6
Interquartile range 0

Descriptive statistics

Standard deviation 0.51996
Coef of variation 3.0918
Kurtosis 23.73
Mean 0.16818
MAD 0.29441
Skewness 4.2631
Sum 3326
Variance 0.27036
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17311 87.5%
 
1 1898 9.6%
 
2 387 2.0%
 
3 96 0.5%
 
4 60 0.3%
 
5 24 0.1%
 
6 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17311 87.5%
 
1 1898 9.6%
 
2 387 2.0%
 
3 96 0.5%
 
4 60 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
2 387 2.0%
 
3 96 0.5%
 
4 60 0.3%
 
5 24 0.1%
 
6 1 0.0%
 

TwoYear_contract_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.19169
Minimum 0
Maximum 10
Zeros (%) 83.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.46283
Coef of variation 2.4145
Kurtosis 19.591
Mean 0.19169
MAD 0.31867
Skewness 3.1418
Sum 3791
Variance 0.21421
Memory size 154.6 KiB
Value Count Frequency (%)  
0 16439 83.1%
 
1 2972 15.0%
 
2 301 1.5%
 
3 52 0.3%
 
4 9 0.0%
 
5 3 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 16439 83.1%
 
1 2972 15.0%
 
2 301 1.5%
 
3 52 0.3%
 
4 9 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2 301 1.5%
 
3 52 0.3%
 
4 9 0.0%
 
5 3 0.0%
 
10 1 0.0%
 

VerizonUNL_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.4488
Minimum 0
Maximum 12
Zeros (%) 74.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 12
Range 12
Interquartile range 1

Descriptive statistics

Standard deviation 0.97466
Coef of variation 2.1717
Kurtosis 12.209
Mean 0.4488
MAD 0.6655
Skewness 3.0834
Sum 8876
Variance 0.94996
Memory size 154.6 KiB
Value Count Frequency (%)  
0 14663 74.1%
 
1 3133 15.8%
 
2 1058 5.3%
 
3 405 2.0%
 
4 294 1.5%
 
5 160 0.8%
 
6 34 0.2%
 
7 17 0.1%
 
8 8 0.0%
 
9 3 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14663 74.1%
 
1 3133 15.8%
 
2 1058 5.3%
 
3 405 2.0%
 
4 294 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
7 17 0.1%
 
8 8 0.0%
 
9 3 0.0%
 
10 1 0.0%
 
12 1 0.0%
 

VoiceOnly_pplangrp_cnt
Numeric

Distinct count 7
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.14689
Minimum 0
Maximum 7
Zeros (%) 86.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.40391
Coef of variation 2.7498
Kurtosis 17.201
Mean 0.14689
MAD 0.25515
Skewness 3.3236
Sum 2905
Variance 0.16314
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17177 86.9%
 
1 2336 11.8%
 
2 241 1.2%
 
3 12 0.1%
 
4 6 0.0%
 
5 4 0.0%
 
7 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17177 86.9%
 
1 2336 11.8%
 
2 241 1.2%
 
3 12 0.1%
 
4 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2 241 1.2%
 
3 12 0.1%
 
4 6 0.0%
 
5 4 0.0%
 
7 1 0.0%
 

WearablDev_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.05198
Minimum 0
Maximum 4
Zeros (%) 95.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.26498
Coef of variation 5.0978
Kurtosis 42.682
Mean 0.05198
MAD 0.099523
Skewness 5.998
Sum 1028
Variance 0.070215
Memory size 154.6 KiB
Value Count Frequency (%)  
0 18933 95.7%
 
1 680 3.4%
 
2 147 0.7%
 
3 14 0.1%
 
4 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 18933 95.7%
 
1 680 3.4%
 
2 147 0.7%
 
3 14 0.1%
 
4 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 18933 95.7%
 
1 680 3.4%
 
2 147 0.7%
 
3 14 0.1%
 
4 3 0.0%
 

access_rev_acrtn
Numeric

Distinct count 2520
Unique (%) 12.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -7.1152
Minimum -266.5
Maximum 106.47
Zeros (%) 40.6%

Quantile statistics

Minimum -266.5
5-th percentile -34.372
Q1 -8.71
Median -1.13
Q3 0
95-th percentile 0
Maximum 106.47
Range 372.97
Interquartile range 8.71

Descriptive statistics

Standard deviation 14.153
Coef of variation -1.9891
Kurtosis 22.427
Mean -7.1152
MAD 8.7665
Skewness -3.5904
Sum -140720
Variance 200.3
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 8035 40.6%
 
-10.0 307 1.6%
 
-5.0 294 1.5%
 
-16.25 236 1.2%
 
-8.75 187 0.9%
 
-10.83 152 0.8%
 
-27.5 146 0.7%
 
-6.67 138 0.7%
 
-13.0 135 0.7%
 
-8.0 128 0.6%
 
Other values (2510) 10019 50.7%
 

Minimum 5 values

Value Count Frequency (%)  
-266.5 1 0.0%
 
-210.0 1 0.0%
 
-177.42 1 0.0%
 
-159.4 1 0.0%
 
-151.7 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
42.75 1 0.0%
 
43.33 1 0.0%
 
60.0 1 0.0%
 
75.0 1 0.0%
 
106.47 1 0.0%
 

active_loan_cnt
Numeric

Distinct count 36
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.8376
Minimum 0
Maximum 108
Zeros (%) 49.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 1
95-th percentile 9
Maximum 108
Range 108
Interquartile range 1

Descriptive statistics

Standard deviation 4.348
Coef of variation 2.3661
Kurtosis 49.238
Mean 1.8376
MAD 2.3062
Skewness 5.2278
Sum 36343
Variance 18.905
Memory size 154.6 KiB
Value Count Frequency (%)  
0 9745 49.3%
 
1 5846 29.6%
 
4 1274 6.4%
 
2 1005 5.1%
 
9 421 2.1%
 
16 367 1.9%
 
6 290 1.5%
 
3 190 1.0%
 
25 121 0.6%
 
12 113 0.6%
 
Other values (26) 405 2.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 9745 49.3%
 
1 5846 29.6%
 
2 1005 5.1%
 
3 190 1.0%
 
4 1274 6.4%
 

Maximum 5 values

Value Count Frequency (%)  
50 1 0.0%
 
56 1 0.0%
 
60 1 0.0%
 
91 1 0.0%
 
108 1 0.0%
 

arpu_amt
Numeric

Distinct count 12110
Unique (%) 61.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 91.849
Minimum -1857.8
Maximum 1633.8
Zeros (%) 4.6%

Quantile statistics

Minimum -1857.8
5-th percentile 0
Q1 41.8
Median 74.19
Q3 117.22
95-th percentile 248.09
Maximum 1633.8
Range 3491.6
Interquartile range 75.42

Descriptive statistics

Standard deviation 91.223
Coef of variation 0.99318
Kurtosis 26.881
Mean 91.849
MAD 58.425
Skewness 1.7897
Sum 1816500
Variance 8321.6
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 914 4.6%
 
15.0 43 0.2%
 
1.02 19 0.1%
 
0.94 16 0.1%
 
0.92 11 0.1%
 
1.0 11 0.1%
 
60.97 10 0.1%
 
0.96 8 0.0%
 
79.16 8 0.0%
 
1.06 8 0.0%
 
Other values (12100) 18729 94.7%
 

Minimum 5 values

Value Count Frequency (%)  
-1857.8 1 0.0%
 
-1057.14 1 0.0%
 
-809.6 1 0.0%
 
-596.04 1 0.0%
 
-473.33 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
860.22 1 0.0%
 
861.61 1 0.0%
 
880.85 1 0.0%
 
885.46 1 0.0%
 
1633.79 1 0.0%
 

billed_acct_size
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.391
Minimum 0
Maximum 12
Zeros (%) 8.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 1
Q3 2
95-th percentile 4
Maximum 12
Range 12
Interquartile range 1

Descriptive statistics

Standard deviation 1.028
Coef of variation 0.73906
Kurtosis 7.1592
Mean 1.391
MAD 0.72887
Skewness 2.1334
Sum 27509
Variance 1.0568
Memory size 154.6 KiB
Value Count Frequency (%)  
1 12636 63.9%
 
2 3339 16.9%
 
0 1630 8.2%
 
3 1088 5.5%
 
4 680 3.4%
 
5 292 1.5%
 
6 68 0.3%
 
7 28 0.1%
 
8 7 0.0%
 
10 4 0.0%
 
Other values (2) 5 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 1630 8.2%
 
1 12636 63.9%
 
2 3339 16.9%
 
3 1088 5.5%
 
4 680 3.4%
 

Maximum 5 values

Value Count Frequency (%)  
7 28 0.1%
 
8 7 0.0%
 
9 3 0.0%
 
10 4 0.0%
 
12 2 0.0%
 

birth_year
Numeric

Distinct count 88
Unique (%) 0.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1971.2
Minimum 0
Maximum 2000
Zeros (%) 0.1%

Quantile statistics

Minimum 0
5-th percentile 1940
Q1 1960
Median 1976
Q3 1988
95-th percentile 1997
Maximum 2000
Range 2000
Interquartile range 28

Descriptive statistics

Standard deviation 60.573
Coef of variation 0.030729
Kurtosis 961.39
Mean 1971.2
MAD 17.1
Skewness -29.635
Sum 38984232
Variance 3669.1
Memory size 154.6 KiB
Value Count Frequency (%)  
1990 488 2.5%
 
1988 475 2.4%
 
1989 470 2.4%
 
1985 468 2.4%
 
1997 466 2.4%
 
1992 455 2.3%
 
1991 454 2.3%
 
1995 449 2.3%
 
1993 449 2.3%
 
1994 440 2.2%
 
Other values (78) 15163 76.7%
 

Minimum 5 values

Value Count Frequency (%)  
0 17 0.1%
 
1911 2 0.0%
 
1912 3 0.0%
 
1915 1 0.0%
 
1917 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1996 410 2.1%
 
1997 466 2.4%
 
1998 438 2.2%
 
1999 241 1.2%
 
2000 1 0.0%
 

cust_status_ind
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.17101
0
16395
1
3382
Value Count Frequency (%)  
0 16395 82.9%
 
1 3382 17.1%
 

data_overage_amt
Numeric

Distinct count 499
Unique (%) 2.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.1509
Minimum 0
Maximum 1515
Zeros (%) 86.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 7.5
Maximum 1515
Range 1515
Interquartile range 0

Descriptive statistics

Standard deviation 18.845
Coef of variation 8.7614
Kurtosis 2407.7
Mean 2.1509
MAD 3.8134
Skewness 37.778
Sum 42539
Variance 355.13
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 17088 86.4%
 
7.5 145 0.7%
 
15.0 121 0.6%
 
5.0 102 0.5%
 
3.75 87 0.4%
 
2.5 68 0.3%
 
3.0 57 0.3%
 
2.14 53 0.3%
 
10.0 48 0.2%
 
0.75 45 0.2%
 
Other values (489) 1963 9.9%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 17088 86.4%
 
0.01 6 0.0%
 
0.02 3 0.0%
 
0.03 3 0.0%
 
0.04 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
397.5 1 0.0%
 
400.0 1 0.0%
 
705.0 1 0.0%
 
750.0 1 0.0%
 
1515.0 1 0.0%
 

data_ovrg_Yes_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.1434
Minimum 0
Maximum 4
Zeros (%) 86.3%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.37136
Coef of variation 2.5897
Kurtosis 6.7238
Mean 0.1434
MAD 0.24763
Skewness 2.5559
Sum 2836
Variance 0.13791
Memory size 154.6 KiB
Value Count Frequency (%)  
0 17076 86.3%
 
1 2579 13.0%
 
2 110 0.6%
 
3 11 0.1%
 
4 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 17076 86.3%
 
1 2579 13.0%
 
2 110 0.6%
 
3 11 0.1%
 
4 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 17076 86.3%
 
1 2579 13.0%
 
2 110 0.6%
 
3 11 0.1%
 
4 1 0.0%
 

eqp_rev_acrtn
Numeric

Distinct count 2782
Unique (%) 14.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -5.9431
Minimum -456
Maximum 375
Zeros (%) 52.2%

Quantile statistics

Minimum -456
5-th percentile -33.822
Q1 -3.04
Median 0
Q3 0
95-th percentile 0
Maximum 375
Range 831
Interquartile range 3.04

Descriptive statistics

Standard deviation 23.728
Coef of variation -3.9925
Kurtosis 85.04
Mean -5.9431
MAD 9.5077
Skewness -6.0725
Sum -117540
Variance 563
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 10316 52.2%
 
-0.01 227 1.1%
 
-0.02 159 0.8%
 
-0.04 116 0.6%
 
-0.08 105 0.5%
 
-0.03 71 0.4%
 
-0.12 60 0.3%
 
-0.06 56 0.3%
 
-0.05 53 0.3%
 
-1.0 42 0.2%
 
Other values (2772) 8572 43.3%
 

Minimum 5 values

Value Count Frequency (%)  
-456.0 1 0.0%
 
-450.0 1 0.0%
 
-435.01 1 0.0%
 
-434.41 1 0.0%
 
-359.88 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
218.95 1 0.0%
 
229.35 1 0.0%
 
233.27 1 0.0%
 
322.7 1 0.0%
 
375.0 2 0.0%
 

frequency
Numeric

Distinct count 31
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 9.3389
Minimum 1
Maximum 31
Zeros (%) 0.0%

Quantile statistics

Minimum 1
5-th percentile 1
Q1 3
Median 6
Q3 14
95-th percentile 26
Maximum 31
Range 30
Interquartile range 11

Descriptive statistics

Standard deviation 7.9967
Coef of variation 0.85628
Kurtosis -0.22969
Mean 9.3389
MAD 6.639
Skewness 0.9908
Sum 184695
Variance 63.947
Memory size 154.6 KiB
Value Count Frequency (%)  
2 2995 15.1%
 
4 2224 11.2%
 
3 1587 8.0%
 
5 1220 6.2%
 
1 1134 5.7%
 
6 1047 5.3%
 
7 934 4.7%
 
8 792 4.0%
 
9 725 3.7%
 
20 639 3.2%
 
Other values (21) 6480 32.8%
 

Minimum 5 values

Value Count Frequency (%)  
1 1134 5.7%
 
2 2995 15.1%
 
3 1587 8.0%
 
4 2224 11.2%
 
5 1220 6.2%
 

Maximum 5 values

Value Count Frequency (%)  
27 205 1.0%
 
28 163 0.8%
 
29 142 0.7%
 
30 112 0.6%
 
31 105 0.5%
 

gl_late_fees_amt
Numeric

Distinct count 1312
Unique (%) 6.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 9.9283
Minimum -48
Maximum 135
Zeros (%) 42.2%

Quantile statistics

Minimum -48
5-th percentile 0
Q1 0
Median 5
Q3 15
95-th percentile 40
Maximum 135
Range 183
Interquartile range 15

Descriptive statistics

Standard deviation 14.352
Coef of variation 1.4456
Kurtosis 8.4626
Mean 9.9283
MAD 10.065
Skewness 2.4318
Sum 196350
Variance 205.98
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 8341 42.2%
 
5.0 3148 15.9%
 
10.0 2255 11.4%
 
15.0 1165 5.9%
 
20.0 841 4.3%
 
25.0 603 3.0%
 
30.0 411 2.1%
 
35.0 308 1.6%
 
40.0 230 1.2%
 
45.0 179 0.9%
 
Other values (1302) 2296 11.6%
 

Minimum 5 values

Value Count Frequency (%)  
-48.0 1 0.0%
 
-45.0 1 0.0%
 
-25.11 1 0.0%
 
-22.0 1 0.0%
 
-15.0 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
118.51 1 0.0%
 
120.0 2 0.0%
 
125.0 2 0.0%
 
130.0 1 0.0%
 
135.0 3 0.0%
 

gl_nonrecur_rev_amt
Numeric

Distinct count 4899
Unique (%) 24.8%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 45.222
Minimum -2991.1
Maximum 1515
Zeros (%) 26.2%

Quantile statistics

Minimum -2991.1
5-th percentile 0
Q1 0
Median 10
Q3 50.35
95-th percentile 220.24
Maximum 1515
Range 4506.1
Interquartile range 50.35

Descriptive statistics

Standard deviation 107.07
Coef of variation 2.3676
Kurtosis 66.911
Mean 45.222
MAD 60.377
Skewness 1.1038
Sum 894350
Variance 11463
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 5174 26.2%
 
5.0 1658 8.4%
 
10.0 1175 5.9%
 
15.0 711 3.6%
 
20.0 393 2.0%
 
30.0 339 1.7%
 
25.0 334 1.7%
 
35.0 252 1.3%
 
40.0 201 1.0%
 
45.0 180 0.9%
 
Other values (4889) 9360 47.3%
 

Minimum 5 values

Value Count Frequency (%)  
-2991.08 1 0.0%
 
-1857.8 1 0.0%
 
-1592.0 1 0.0%
 
-1251.28 1 0.0%
 
-1242.11 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1407.28 1 0.0%
 
1440.0 1 0.0%
 
1486.03 1 0.0%
 
1505.0 1 0.0%
 
1515.0 1 0.0%
 

handset_paychnl_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.2554
Minimum 0
Maximum 13
Zeros (%) 81.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 0.64897
Coef of variation 2.541
Kurtosis 27.161
Mean 0.2554
MAD 0.41831
Skewness 3.9091
Sum 5051
Variance 0.42117
Memory size 154.6 KiB
Value Count Frequency (%)  
0 16196 81.9%
 
1 2596 13.1%
 
2 680 3.4%
 
3 195 1.0%
 
4 67 0.3%
 
5 32 0.2%
 
6 6 0.0%
 
7 3 0.0%
 
13 1 0.0%
 
12 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 16196 81.9%
 
1 2596 13.1%
 
2 680 3.4%
 
3 195 1.0%
 
4 67 0.3%
 

Maximum 5 values

Value Count Frequency (%)  
5 32 0.2%
 
6 6 0.0%
 
7 3 0.0%
 
12 1 0.0%
 
13 1 0.0%
 

index
Numeric

Distinct count 19777
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 380340
Minimum 9
Maximum 758177
Zeros (%) 0.0%

Quantile statistics

Minimum 9
5-th percentile 37444
Q1 189170
Median 381390
Q3 572020
95-th percentile 720800
Maximum 758177
Range 758168
Interquartile range 382850

Descriptive statistics

Standard deviation 219720
Coef of variation 0.57769
Kurtosis -1.2125
Mean 380340
MAD 190580
Skewness -0.0034771
Sum 7521936736
Variance 48276000000
Memory size 154.6 KiB
Value Count Frequency (%)  
1 464893 2350.7%
 
1 627376 3172.3%
 
1 737917 3731.2%
 
1 198693 1004.7%
 
1 10880 55.0%
 
1 241850 1222.9%
 
1 211586 1069.9%
 
1 2692 13.6%
 
1 326043 1648.6%
 
1 680585 3441.3%
 
Other values (19767) 7518434221 38016050.1%
 

Minimum 5 values

Value Count Frequency (%)  
1 464893 2350.7%
 
1 627376 3172.3%
 
1 737917 3731.2%
 
1 198693 1004.7%
 
1 10880 55.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 477901 2416.4%
 
1 601343 3040.6%
 
1 218368 1104.2%
 
1 214259 1083.4%
 
1 131072 662.7%
 

monetary
Numeric

Distinct count 17627
Unique (%) 89.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 705.89
Minimum -3238.4
Maximum 1851.2
Zeros (%) 4.6%

Quantile statistics

Minimum -3238.4
5-th percentile 0
Q1 222.45
Median 617.36
Q3 1180.3
95-th percentile 1635.9
Maximum 1851.2
Range 5089.6
Interquartile range 957.81

Descriptive statistics

Standard deviation 548.88
Coef of variation 0.77757
Kurtosis -0.86975
Mean 705.89
MAD 475.58
Skewness 0.25934
Sum 13960000
Variance 301270
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 914 4.6%
 
15.0 42 0.2%
 
1.85 12 0.1%
 
2.01 10 0.1%
 
30.0 9 0.0%
 
1.88 9 0.0%
 
-32.92 8 0.0%
 
1.91 8 0.0%
 
1.89 7 0.0%
 
2.03 7 0.0%
 
Other values (17617) 18751 94.8%
 

Minimum 5 values

Value Count Frequency (%)  
-3238.4 1 0.0%
 
-2114.29 1 0.0%
 
-2092.43 1 0.0%
 
-1857.8 1 0.0%
 
-1756.38 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1823.2 1 0.0%
 
1827.67 1 0.0%
 
1830.76 1 0.0%
 
1833.37 1 0.0%
 
1851.24 1 0.0%
 

pplan_access_acrtn
Numeric

Distinct count 602
Unique (%) 3.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 20.885
Minimum -120
Maximum 1893.3
Zeros (%) 20.9%

Quantile statistics

Minimum -120
5-th percentile 0
Q1 0.75
Median 5
Q3 20
95-th percentile 110
Maximum 1893.3
Range 2013.3
Interquartile range 19.25

Descriptive statistics

Standard deviation 47.443
Coef of variation 2.2716
Kurtosis 160.93
Mean 20.885
MAD 24.797
Skewness 7.7961
Sum 413050
Variance 2250.9
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 4139 20.9%
 
20.0 506 2.6%
 
10.0 451 2.3%
 
32.5 389 2.0%
 
5.0 388 2.0%
 
17.5 350 1.8%
 
35.0 345 1.7%
 
110.0 312 1.6%
 
16.25 298 1.5%
 
40.0 253 1.3%
 
Other values (592) 12346 62.4%
 

Minimum 5 values

Value Count Frequency (%)  
-120.0 1 0.0%
 
0.0 4139 20.9%
 
0.16 35 0.2%
 
0.17 55 0.3%
 
0.18 9 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
660.0 1 0.0%
 
720.0 2 0.0%
 
880.0 1 0.0%
 
900.0 1 0.0%
 
1893.33 1 0.0%
 

recency
Numeric

Distinct count 31
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 521.77
Minimum 51
Maximum 962
Zeros (%) 0.0%

Quantile statistics

Minimum 51
5-th percentile 51
Q1 354
Median 597
Q3 719
95-th percentile 872
Maximum 962
Range 911
Interquartile range 365

Descriptive statistics

Standard deviation 263.79
Coef of variation 0.50556
Kurtosis -0.73918
Mean 521.77
MAD 216.6
Skewness -0.63051
Sum 10318984
Variance 69584
Memory size 154.6 KiB
Value Count Frequency (%)  
51 3163 16.0%
 
597 1322 6.7%
 
628 1302 6.6%
 
658 1296 6.6%
 
689 1049 5.3%
 
566 1043 5.3%
 
719 973 4.9%
 
750 960 4.9%
 
538 898 4.5%
 
781 799 4.0%
 
Other values (21) 6972 35.3%
 

Minimum 5 values

Value Count Frequency (%)  
51 3163 16.0%
 
81 76 0.4%
 
112 95 0.5%
 
142 109 0.6%
 
173 133 0.7%
 

Maximum 5 values

Value Count Frequency (%)  
842 645 3.3%
 
872 563 2.8%
 
903 345 1.7%
 
931 295 1.5%
 
962 25 0.1%
 

recurring_paychnl_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.3412
Minimum 0
Maximum 10
Zeros (%) 73.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.6647
Coef of variation 1.9481
Kurtosis 12.967
Mean 0.3412
MAD 0.5006
Skewness 2.7959
Sum 6748
Variance 0.44183
Memory size 154.6 KiB
Value Count Frequency (%)  
0 14508 73.4%
 
1 4213 21.3%
 
2 791 4.0%
 
3 159 0.8%
 
4 73 0.4%
 
5 22 0.1%
 
6 7 0.0%
 
7 2 0.0%
 
10 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 14508 73.4%
 
1 4213 21.3%
 
2 791 4.0%
 
3 159 0.8%
 
4 73 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
5 22 0.1%
 
6 7 0.0%
 
7 2 0.0%
 
8 1 0.0%
 
10 1 0.0%
 

tot_revenue_acrtn
Numeric

Distinct count 8243
Unique (%) 41.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -30.722
Minimum -628.74
Maximum 704.12
Zeros (%) 7.9%

Quantile statistics

Minimum -628.74
5-th percentile -142.1
Q1 -35.66
Median -11.5
Q3 -1.33
95-th percentile 2.724
Maximum 704.12
Range 1332.9
Interquartile range 34.33

Descriptive statistics

Standard deviation 60.58
Coef of variation -1.9719
Kurtosis 19.761
Mean -30.722
MAD 36.035
Skewness -2.4221
Sum -607580
Variance 3669.9
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 1572 7.9%
 
0.01 268 1.4%
 
0.02 46 0.2%
 
0.03 37 0.2%
 
-1.64 34 0.2%
 
-1.36 28 0.1%
 
-1.56 27 0.1%
 
-1.5 26 0.1%
 
-1.74 24 0.1%
 
-1.42 23 0.1%
 
Other values (8233) 17692 89.5%
 

Minimum 5 values

Value Count Frequency (%)  
-628.74 1 0.0%
 
-612.8 1 0.0%
 
-612.56 1 0.0%
 
-601.31 1 0.0%
 
-585.84 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
535.35 1 0.0%
 
598.35 1 0.0%
 
638.32 1 0.0%
 
703.72 1 0.0%
 
704.12 1 0.0%
 

usage_rev_acrtn
Numeric

Distinct count 577
Unique (%) 2.9%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.1153
Minimum -200
Maximum 750
Zeros (%) 92.7%

Quantile statistics

Minimum -200
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.41
Maximum 750
Range 950
Interquartile range 0

Descriptive statistics

Standard deviation 16.748
Coef of variation 15.017
Kurtosis 782.25
Mean 1.1153
MAD 2.3084
Skewness 24.666
Sum 22057
Variance 280.51
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 18326 92.7%
 
7.5 64 0.3%
 
5.0 36 0.2%
 
0.1 33 0.2%
 
0.5 32 0.2%
 
15.0 29 0.1%
 
1.0 26 0.1%
 
0.9 24 0.1%
 
0.66 18 0.1%
 
3.75 18 0.1%
 
Other values (567) 1171 5.9%
 

Minimum 5 values

Value Count Frequency (%)  
-200.0 1 0.0%
 
-154.0 1 0.0%
 
-90.0 1 0.0%
 
-71.12 1 0.0%
 
-68.0 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
548.66 1 0.0%
 
571.46 1 0.0%
 
575.19 1 0.0%
 
705.0 1 0.0%
 
750.0 1 0.0%
 

voice_overage_amt
Numeric

Distinct count 397
Unique (%) 2.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.37933
Minimum 0
Maximum 548.66
Zeros (%) 96.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 548.66
Range 548.66
Interquartile range 0

Descriptive statistics

Standard deviation 7.0169
Coef of variation 18.498
Kurtosis 3593.9
Mean 0.37933
MAD 0.73318
Skewness 52.186
Sum 7502.1
Variance 49.237
Memory size 154.6 KiB
Value Count Frequency (%)  
0.0 19028 96.2%
 
0.9 28 0.1%
 
1.79 18 0.1%
 
0.14 13 0.1%
 
0.6 12 0.1%
 
0.26 12 0.1%
 
0.08 10 0.1%
 
2.68 10 0.1%
 
0.09 9 0.0%
 
0.45 8 0.0%
 
Other values (387) 629 3.2%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 19028 96.2%
 
0.02 1 0.0%
 
0.04 6 0.0%
 
0.05 1 0.0%
 
0.07 8 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
165.58 1 0.0%
 
213.01 1 0.0%
 
228.74 1 0.0%
 
521.76 1 0.0%
 
548.66 1 0.0%
 

Correlations

Sample

cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
50842 0 324 11 1474.57 2 0 3 6 1956 3 2 0 3 3 0 2 2 0 1 0 0 0 0 2 3 2 0 0 0 0 1 0 1 0 0 0 134.05 25.0 145.38 0.00 -22.20 12.09 -13.08 0.0 4.09 0.0 2
344405 0 507 16 1117.69 1 0 1 0 1968 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 69.86 15.0 15.08 2.19 -3.91 -2.19 0.00 0.0 0.00 0.0 2
531372 0 416 11 1055.67 1 0 1 1 1999 0 1 1 1 0 0 0 1 0 0 0 0 1 0 0 0 1 0 0 0 0 1 0 0 0 0 0 95.97 20.0 20.00 3.64 -14.10 0.00 -1.98 0.0 0.00 0.0 2
649366 0 658 1 0.00 0 0 0 0 1983 1 0 1 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0.00 0.0 0.00 0.00 0.00 0.00 0.00 0.0 0.00 0.0 2
218083 0 293 13 1514.10 0 0 2 2 1996 2 0 1 2 0 0 0 1 0 0 0 0 2 1 2 0 0 0 0 0 1 1 0 0 2 1 0 116.47 0.0 0.00 0.00 -18.65 0.00 -1.63 0.0 0.00 0.0 2

Cluster 2 - Bronze Findings:

1. Recency - Very Low (Mean - 524.16)
2. Frequency - Very Low (Mean - 9.2962)
3. Monetary - Very Low (Mean - 702.76)
6. IphoneDev_cnt - 51.9%
10. ARPU Amount - 91.445 (Mean)
11. billed_acct_size - 1.3875 (Mean)
12. Birth Year - 1971 (Mean)
13. % of AT&T port in - 13.1%
14. % of Sprint_port in - 6.5%
15. % of T mobile port in - 12.2%
16. % of Android Devices - 34%
17. CA users - 49.1%
18. TX users - 27.1%
19. CommunicStore_chnl_cnt - 40.2%
20. Edge_contract_cnt - 44.2% 
22. SingleLine_plan_cnt - 13.2%
23. data_ovrg_Yes_cnt - 13.4%
25. Hispanic Ethnicity - 26.7%
27. Purch_equip_cnt - 43.5%
29. WearablDev_cnt - 3.2%
30. active_loan_cnt - 50%
31. Total Revenue Acrtn - -30.913(Mean)
In [46]:
df[df['Customer_Cluster']== 2].head(10)
Out[46]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
50842 0 324 11 1474.57 2 0 3 6 1956 3 2 0 3 3 0 2 2 0 1 0 0 0 0 2 3 2 0 0 0 0 1 0 1 0 0 0 134.05 25.0 145.38 0.00 -22.20 12.09 -13.08 0.0 4.09 0.0 2
344405 0 507 16 1117.69 1 0 1 0 1968 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 69.86 15.0 15.08 2.19 -3.91 -2.19 0.00 0.0 0.00 0.0 2
531372 0 416 11 1055.67 1 0 1 1 1999 0 1 1 1 0 0 0 1 0 0 0 0 1 0 0 0 1 0 0 0 0 1 0 0 0 0 0 95.97 20.0 20.00 3.64 -14.10 0.00 -1.98 0.0 0.00 0.0 2
649366 0 658 1 0.00 0 0 0 0 1983 1 0 1 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0.00 0.0 0.00 0.00 0.00 0.00 0.00 0.0 0.00 0.0 2
218083 0 293 13 1514.10 0 0 2 2 1996 2 0 1 2 0 0 0 1 0 0 0 0 2 1 2 0 0 0 0 0 1 1 0 0 2 1 0 116.47 0.0 0.00 0.00 -18.65 0.00 -1.63 0.0 0.00 0.0 2
111340 0 263 16 1414.54 0 0 1 0 1993 1 0 1 1 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 0 0 88.41 5.0 5.00 0.00 -5.73 0.00 0.00 0.0 0.00 0.0 2
350378 0 658 2 172.45 1 0 1 0 1982 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 1 0 0 1 0 0 0 0 0 86.22 5.0 5.00 20.00 -17.44 0.00 0.00 0.0 0.00 0.0 2
215561 0 689 3 179.94 1 0 1 1 1990 1 0 0 0 0 1 0 1 0 0 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 0 0 59.98 0.0 0.00 13.33 -66.29 -13.33 -4.66 0.0 0.00 0.0 2
499436 0 597 4 189.99 0 0 1 0 1988 0 0 0 0 0 0 1 0 1 0 0 0 0 0 1 0 0 0 1 0 1 1 0 0 0 0 0 47.50 10.0 40.20 12.50 -25.02 -12.50 0.00 0.0 7.50 0.0 2
602875 0 903 2 342.04 2 0 2 0 1980 0 0 2 0 0 2 0 0 2 0 0 0 2 0 0 0 0 0 0 0 1 2 0 0 0 0 0 171.02 5.0 5.00 80.00 -34.56 0.00 0.00 0.0 0.00 0.0 2

Cluster 2 contains customers who are either churned or are having trouble paying their monthly phone bills. We can name this cluster as "bronze"

Cluster 3:

In [47]:
profile4 = pandas_profiling.ProfileReport(df[df['Customer_Cluster']== 3])
profile4
Out[47]:

Overview

Dataset info

Number of variables 48
Number of observations 16041
Total Missing (%) 0.0%
Total size in memory 5.8 MiB
Average record size in memory 380.0 B

Variables types

Numeric 45
Categorical 0
Boolean 1
Date 0
Text (Unique) 0
Rejected 2
Unsupported 0

Warnings

Variables

ATT_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.52634
Minimum 0
Maximum 10
Zeros (%) 70.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 0.96352
Coef of variation 1.8306
Kurtosis 4.4422
Mean 0.52634
MAD 0.74634
Skewness 2.0353
Sum 8443
Variance 0.92837
Memory size 125.4 KiB
Value Count Frequency (%)  
0 11373 70.9%
 
1 2080 13.0%
 
2 1777 11.1%
 
3 509 3.2%
 
4 248 1.5%
 
5 41 0.3%
 
6 10 0.1%
 
7 1 0.0%
 
10 1 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 11373 70.9%
 
1 2080 13.0%
 
2 1777 11.1%
 
3 509 3.2%
 
4 248 1.5%
 

Maximum 5 values

Value Count Frequency (%)  
5 41 0.3%
 
6 10 0.1%
 
7 1 0.0%
 
8 1 0.0%
 
10 1 0.0%
 

AndroidDev_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.87195
Minimum 0
Maximum 11
Zeros (%) 50.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 3
Maximum 11
Range 11
Interquartile range 2

Descriptive statistics

Standard deviation 1.0925
Coef of variation 1.2529
Kurtosis 2.2467
Mean 0.87195
MAD 0.88038
Skewness 1.3394
Sum 13987
Variance 1.1935
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8098 50.5%
 
1 3750 23.4%
 
2 2904 18.1%
 
3 867 5.4%
 
4 324 2.0%
 
5 74 0.5%
 
6 17 0.1%
 
8 3 0.0%
 
9 2 0.0%
 
7 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8098 50.5%
 
1 3750 23.4%
 
2 2904 18.1%
 
3 867 5.4%
 
4 324 2.0%
 

Maximum 5 values

Value Count Frequency (%)  
6 17 0.1%
 
7 1 0.0%
 
8 3 0.0%
 
9 2 0.0%
 
11 1 0.0%
 

BasicDev_cnt
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.045758
Minimum 0
Maximum 5
Zeros (%) 96.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 5
Range 5
Interquartile range 0

Descriptive statistics

Standard deviation 0.2447
Coef of variation 5.3476
Kurtosis 63.879
Mean 0.045758
MAD 0.08791
Skewness 6.8203
Sum 734
Variance 0.059876
Memory size 125.4 KiB
Value Count Frequency (%)  
0 15409 96.1%
 
1 551 3.4%
 
2 65 0.4%
 
3 13 0.1%
 
5 2 0.0%
 
4 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15409 96.1%
 
1 551 3.4%
 
2 65 0.4%
 
3 13 0.1%
 
4 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 551 3.4%
 
2 65 0.4%
 
3 13 0.1%
 
4 1 0.0%
 
5 2 0.0%
 

BeyondUnlmtd_plan_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.43258
Minimum 0
Maximum 10
Zeros (%) 84.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 3
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 1.1133
Coef of variation 2.5736
Kurtosis 8.6131
Mean 0.43258
MAD 0.72806
Skewness 2.8431
Sum 6939
Variance 1.2394
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13499 84.2%
 
2 1050 6.5%
 
3 651 4.1%
 
4 349 2.2%
 
1 273 1.7%
 
5 139 0.9%
 
6 58 0.4%
 
7 11 0.1%
 
8 5 0.0%
 
10 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13499 84.2%
 
1 273 1.7%
 
2 1050 6.5%
 
3 651 4.1%
 
4 349 2.2%
 

Maximum 5 values

Value Count Frequency (%)  
6 58 0.4%
 
7 11 0.1%
 
8 5 0.0%
 
9 3 0.0%
 
10 3 0.0%
 

CA_state_cd
Numeric

Distinct count 14
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.4118
Minimum 0
Maximum 13
Zeros (%) 53.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 3
95-th percentile 5
Maximum 13
Range 13
Interquartile range 3

Descriptive statistics

Standard deviation 1.7492
Coef of variation 1.239
Kurtosis 0.6809
Mean 1.4118
MAD 1.5214
Skewness 1.0464
Sum 22646
Variance 3.0596
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8513 53.1%
 
2 2719 17.0%
 
3 2112 13.2%
 
4 1362 8.5%
 
5 557 3.5%
 
1 448 2.8%
 
6 220 1.4%
 
7 59 0.4%
 
8 24 0.1%
 
9 12 0.1%
 
Other values (4) 15 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 8513 53.1%
 
1 448 2.8%
 
2 2719 17.0%
 
3 2112 13.2%
 
4 1362 8.5%
 

Maximum 5 values

Value Count Frequency (%)  
9 12 0.1%
 
10 11 0.1%
 
11 1 0.0%
 
12 2 0.0%
 
13 1 0.0%
 

CommunicStore_chnl_cnt
Numeric

Distinct count 14
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.3374
Minimum 0
Maximum 13
Zeros (%) 41.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 1
Q3 2
95-th percentile 4
Maximum 13
Range 13
Interquartile range 2

Descriptive statistics

Standard deviation 1.471
Coef of variation 1.0999
Kurtosis 1.4231
Mean 1.3374
MAD 1.2307
Skewness 1.0948
Sum 21453
Variance 2.1637
Memory size 125.4 KiB
Value Count Frequency (%)  
0 6729 41.9%
 
2 3499 21.8%
 
1 2583 16.1%
 
3 1802 11.2%
 
4 944 5.9%
 
5 326 2.0%
 
6 100 0.6%
 
7 28 0.2%
 
8 19 0.1%
 
10 4 0.0%
 
Other values (4) 7 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6729 41.9%
 
1 2583 16.1%
 
2 3499 21.8%
 
3 1802 11.2%
 
4 944 5.9%
 

Maximum 5 values

Value Count Frequency (%)  
9 4 0.0%
 
10 4 0.0%
 
11 1 0.0%
 
12 1 0.0%
 
13 1 0.0%
 

Customer_Cluster
Constant

This variable is constant and should be ignored for analysis

Constant value 3

DataOnly_Revtype_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.48064
Minimum 0
Maximum 9
Zeros (%) 64.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 9
Range 9
Interquartile range 1

Descriptive statistics

Standard deviation 0.76399
Coef of variation 1.5895
Kurtosis 7.64
Mean 0.48064
MAD 0.61737
Skewness 2.116
Sum 7710
Variance 0.58368
Memory size 125.4 KiB
Value Count Frequency (%)  
0 10302 64.2%
 
1 4243 26.5%
 
2 1161 7.2%
 
3 245 1.5%
 
4 63 0.4%
 
5 17 0.1%
 
8 4 0.0%
 
6 3 0.0%
 
7 2 0.0%
 
9 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10302 64.2%
 
1 4243 26.5%
 
2 1161 7.2%
 
3 245 1.5%
 
4 63 0.4%
 

Maximum 5 values

Value Count Frequency (%)  
5 17 0.1%
 
6 3 0.0%
 
7 2 0.0%
 
8 4 0.0%
 
9 1 0.0%
 

Edge_contract_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.0941
Minimum 0
Maximum 11
Zeros (%) 3.6%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 1
Median 2
Q3 3
95-th percentile 4
Maximum 11
Range 11
Interquartile range 2

Descriptive statistics

Standard deviation 1.0408
Coef of variation 0.49704
Kurtosis 3.0895
Mean 2.0941
MAD 0.71416
Skewness 0.97407
Sum 33591
Variance 1.0834
Memory size 125.4 KiB
Value Count Frequency (%)  
2 7911 49.3%
 
1 3447 21.5%
 
3 2638 16.4%
 
4 1086 6.8%
 
0 579 3.6%
 
5 273 1.7%
 
6 74 0.5%
 
7 21 0.1%
 
8 7 0.0%
 
11 3 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 579 3.6%
 
1 3447 21.5%
 
2 7911 49.3%
 
3 2638 16.4%
 
4 1086 6.8%
 

Maximum 5 values

Value Count Frequency (%)  
7 21 0.1%
 
8 7 0.0%
 
9 1 0.0%
 
10 1 0.0%
 
11 3 0.0%
 

FamTeens_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.87538
Minimum 0
Maximum 10
Zeros (%) 67.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 10
Range 10
Interquartile range 2

Descriptive statistics

Standard deviation 1.4481
Coef of variation 1.6543
Kurtosis 1.9763
Mean 0.87538
MAD 1.1832
Skewness 1.6015
Sum 14042
Variance 2.0971
Memory size 125.4 KiB
Value Count Frequency (%)  
0 10841 67.6%
 
2 2004 12.5%
 
3 1317 8.2%
 
4 751 4.7%
 
1 692 4.3%
 
5 287 1.8%
 
6 111 0.7%
 
7 26 0.2%
 
8 6 0.0%
 
9 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10841 67.6%
 
1 692 4.3%
 
2 2004 12.5%
 
3 1317 8.2%
 
4 751 4.7%
 

Maximum 5 values

Value Count Frequency (%)  
6 111 0.7%
 
7 26 0.2%
 
8 6 0.0%
 
9 4 0.0%
 
10 2 0.0%
 

GoUnlimit_plan_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.3458
Minimum 0
Maximum 11
Zeros (%) 53.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 11
Range 11
Interquartile range 2

Descriptive statistics

Standard deviation 1.6954
Coef of variation 1.2598
Kurtosis 0.39821
Mean 1.3458
MAD 1.4756
Skewness 1.0383
Sum 21588
Variance 2.8745
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8654 53.9%
 
2 2835 17.7%
 
3 1877 11.7%
 
4 1333 8.3%
 
1 545 3.4%
 
5 515 3.2%
 
6 196 1.2%
 
7 48 0.3%
 
8 26 0.2%
 
9 6 0.0%
 
Other values (2) 6 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8654 53.9%
 
1 545 3.4%
 
2 2835 17.7%
 
3 1877 11.7%
 
4 1333 8.3%
 

Maximum 5 values

Value Count Frequency (%)  
7 48 0.3%
 
8 26 0.2%
 
9 6 0.0%
 
10 5 0.0%
 
11 1 0.0%
 

Hispanic_ethnicity_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.8722
Minimum 0
Maximum 12
Zeros (%) 69.8%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 12
Range 12
Interquartile range 2

Descriptive statistics

Standard deviation 1.5059
Coef of variation 1.7266
Kurtosis 2.8904
Mean 0.8722
MAD 1.2183
Skewness 1.7476
Sum 13991
Variance 2.2678
Memory size 125.4 KiB
Value Count Frequency (%)  
0 11203 69.8%
 
2 1891 11.8%
 
3 1331 8.3%
 
4 754 4.7%
 
1 358 2.2%
 
5 310 1.9%
 
6 126 0.8%
 
7 39 0.2%
 
8 11 0.1%
 
10 8 0.0%
 
Other values (3) 10 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 11203 69.8%
 
1 358 2.2%
 
2 1891 11.8%
 
3 1331 8.3%
 
4 754 4.7%
 

Maximum 5 values

Value Count Frequency (%)  
8 11 0.1%
 
9 8 0.0%
 
10 8 0.0%
 
11 1 0.0%
 
12 1 0.0%
 

IphoneDev_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.5824
Minimum 0
Maximum 9
Zeros (%) 20.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 2
Q3 2
95-th percentile 4
Maximum 9
Range 9
Interquartile range 1

Descriptive statistics

Standard deviation 1.2042
Coef of variation 0.76097
Kurtosis 0.97098
Mean 1.5824
MAD 0.97472
Skewness 0.73408
Sum 25384
Variance 1.4501
Memory size 125.4 KiB
Value Count Frequency (%)  
2 5386 33.6%
 
1 4481 27.9%
 
0 3291 20.5%
 
3 1785 11.1%
 
4 825 5.1%
 
5 195 1.2%
 
6 55 0.3%
 
7 16 0.1%
 
8 4 0.0%
 
9 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 3291 20.5%
 
1 4481 27.9%
 
2 5386 33.6%
 
3 1785 11.1%
 
4 825 5.1%
 

Maximum 5 values

Value Count Frequency (%)  
5 195 1.2%
 
6 55 0.3%
 
7 16 0.1%
 
8 4 0.0%
 
9 3 0.0%
 

LocalRetailer_chnl_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.53095
Minimum 0
Maximum 10
Zeros (%) 74.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 1.0938
Coef of variation 2.06
Kurtosis 6.5611
Mean 0.53095
MAD 0.79519
Skewness 2.4169
Sum 8517
Variance 1.1963
Memory size 125.4 KiB
Value Count Frequency (%)  
0 12012 74.9%
 
1 1484 9.3%
 
2 1356 8.5%
 
3 691 4.3%
 
4 341 2.1%
 
5 94 0.6%
 
6 40 0.2%
 
7 16 0.1%
 
8 3 0.0%
 
10 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 12012 74.9%
 
1 1484 9.3%
 
2 1356 8.5%
 
3 691 4.3%
 
4 341 2.1%
 

Maximum 5 values

Value Count Frequency (%)  
6 40 0.2%
 
7 16 0.1%
 
8 3 0.0%
 
9 2 0.0%
 
10 2 0.0%
 

MTM_contract_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.748
Minimum 0
Maximum 11
Zeros (%) 10.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 1
Median 2
Q3 2
95-th percentile 4
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 1.1809
Coef of variation 0.67559
Kurtosis 2.8549
Mean 1.748
MAD 0.90215
Skewness 1.1617
Sum 28039
Variance 1.3945
Memory size 125.4 KiB
Value Count Frequency (%)  
1 5839 36.4%
 
2 5438 33.9%
 
3 1791 11.2%
 
0 1641 10.2%
 
4 933 5.8%
 
5 265 1.7%
 
6 83 0.5%
 
7 28 0.2%
 
8 12 0.1%
 
9 7 0.0%
 
Other values (2) 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 1641 10.2%
 
1 5839 36.4%
 
2 5438 33.9%
 
3 1791 11.2%
 
4 933 5.8%
 

Maximum 5 values

Value Count Frequency (%)  
7 28 0.2%
 
8 12 0.1%
 
9 7 0.0%
 
10 3 0.0%
 
11 1 0.0%
 

MatureAdults_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.44249
Minimum 0
Maximum 13
Zeros (%) 84.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 3
Maximum 13
Range 13
Interquartile range 0

Descriptive statistics

Standard deviation 1.1598
Coef of variation 2.621
Kurtosis 8.7272
Mean 0.44249
MAD 0.75175
Skewness 2.8547
Sum 7098
Variance 1.3451
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13626 84.9%
 
2 877 5.5%
 
3 713 4.4%
 
4 415 2.6%
 
5 163 1.0%
 
1 162 1.0%
 
6 54 0.3%
 
7 16 0.1%
 
8 8 0.0%
 
9 5 0.0%
 
Other values (2) 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13626 84.9%
 
1 162 1.0%
 
2 877 5.5%
 
3 713 4.4%
 
4 415 2.6%
 

Maximum 5 values

Value Count Frequency (%)  
7 16 0.1%
 
8 8 0.0%
 
9 5 0.0%
 
10 1 0.0%
 
13 1 0.0%
 

NewVZPlan_plan_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.89639
Minimum 0
Maximum 10
Zeros (%) 61.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 10
Range 10
Interquartile range 2

Descriptive statistics

Standard deviation 1.3224
Coef of variation 1.4752
Kurtosis 1.5503
Mean 0.89639
MAD 1.0966
Skewness 1.4186
Sum 14379
Variance 1.7486
Memory size 125.4 KiB
Value Count Frequency (%)  
0 9812 61.2%
 
2 2568 16.0%
 
1 1477 9.2%
 
3 1339 8.3%
 
4 581 3.6%
 
5 187 1.2%
 
6 59 0.4%
 
7 12 0.1%
 
8 3 0.0%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 9812 61.2%
 
1 1477 9.2%
 
2 2568 16.0%
 
3 1339 8.3%
 
4 581 3.6%
 

Maximum 5 values

Value Count Frequency (%)  
6 59 0.4%
 
7 12 0.1%
 
8 3 0.0%
 
9 2 0.0%
 
10 1 0.0%
 

Purch_equip_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.74378
Minimum 0
Maximum 10
Zeros (%) 54.6%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 3
Maximum 10
Range 10
Interquartile range 1

Descriptive statistics

Standard deviation 1.0655
Coef of variation 1.4325
Kurtosis 5.3754
Mean 0.74378
MAD 0.81162
Skewness 1.9751
Sum 11931
Variance 1.1352
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8752 54.6%
 
1 4451 27.7%
 
2 1753 10.9%
 
3 639 4.0%
 
4 261 1.6%
 
5 127 0.8%
 
6 40 0.2%
 
7 12 0.1%
 
10 2 0.0%
 
9 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8752 54.6%
 
1 4451 27.7%
 
2 1753 10.9%
 
3 639 4.0%
 
4 261 1.6%
 

Maximum 5 values

Value Count Frequency (%)  
6 40 0.2%
 
7 12 0.1%
 
8 2 0.0%
 
9 2 0.0%
 
10 2 0.0%
 

SingleLine_plan_cnt
Numeric

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.034287
Minimum 0
Maximum 10
Zeros (%) 97.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.21955
Coef of variation 6.4032
Kurtosis 315.81
Mean 0.034287
MAD 0.066535
Skewness 11.783
Sum 550
Variance 0.048201
Memory size 125.4 KiB
Value Count Frequency (%)  
0 15564 97.0%
 
1 422 2.6%
 
2 46 0.3%
 
3 6 0.0%
 
4 2 0.0%
 
10 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 15564 97.0%
 
1 422 2.6%
 
2 46 0.3%
 
3 6 0.0%
 
4 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1 422 2.6%
 
2 46 0.3%
 
3 6 0.0%
 
4 2 0.0%
 
10 1 0.0%
 

SngleCpleKids_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.2398
Minimum 0
Maximum 12
Zeros (%) 55.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 12
Range 12
Interquartile range 2

Descriptive statistics

Standard deviation 1.6072
Coef of variation 1.2963
Kurtosis 0.99571
Mean 1.2398
MAD 1.3868
Skewness 1.1624
Sum 19888
Variance 2.5831
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8844 55.1%
 
2 3049 19.0%
 
3 1815 11.3%
 
4 1051 6.6%
 
1 659 4.1%
 
5 392 2.4%
 
6 153 1.0%
 
7 46 0.3%
 
8 17 0.1%
 
9 8 0.0%
 
Other values (3) 7 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 8844 55.1%
 
1 659 4.1%
 
2 3049 19.0%
 
3 1815 11.3%
 
4 1051 6.6%
 

Maximum 5 values

Value Count Frequency (%)  
8 17 0.1%
 
9 8 0.0%
 
10 4 0.0%
 
11 2 0.0%
 
12 1 0.0%
 

Sprint_port_cnt
Numeric

Distinct count 9
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.31731
Minimum 0
Maximum 8
Zeros (%) 82.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 8
Range 8
Interquartile range 0

Descriptive statistics

Standard deviation 0.78192
Coef of variation 2.4642
Kurtosis 9.1667
Mean 0.31731
MAD 0.52084
Skewness 2.8735
Sum 5090
Variance 0.6114
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13165 82.1%
 
1 1343 8.4%
 
2 1067 6.7%
 
3 293 1.8%
 
4 140 0.9%
 
5 28 0.2%
 
7 2 0.0%
 
6 2 0.0%
 
8 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13165 82.1%
 
1 1343 8.4%
 
2 1067 6.7%
 
3 293 1.8%
 
4 140 0.9%
 

Maximum 5 values

Value Count Frequency (%)  
4 140 0.9%
 
5 28 0.2%
 
6 2 0.0%
 
7 2 0.0%
 
8 1 0.0%
 

TX_state_cd
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.88673
Minimum 0
Maximum 11
Zeros (%) 68.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 11
Range 11
Interquartile range 2

Descriptive statistics

Standard deviation 1.4911
Coef of variation 1.6816
Kurtosis 2.669
Mean 0.88673
MAD 1.2157
Skewness 1.6987
Sum 14224
Variance 2.2234
Memory size 125.4 KiB
Value Count Frequency (%)  
0 10996 68.5%
 
2 2116 13.2%
 
3 1347 8.4%
 
4 699 4.4%
 
1 394 2.5%
 
5 298 1.9%
 
6 119 0.7%
 
7 39 0.2%
 
8 19 0.1%
 
9 9 0.1%
 
Other values (2) 5 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 10996 68.5%
 
1 394 2.5%
 
2 2116 13.2%
 
3 1347 8.4%
 
4 699 4.4%
 

Maximum 5 values

Value Count Frequency (%)  
7 39 0.2%
 
8 19 0.1%
 
9 9 0.1%
 
10 4 0.0%
 
11 1 0.0%
 

T_Mobile_port_cnt
Numeric

Distinct count 10
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.30391
Minimum 0
Maximum 10
Zeros (%) 83.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 2
Maximum 10
Range 10
Interquartile range 0

Descriptive statistics

Standard deviation 0.79421
Coef of variation 2.6133
Kurtosis 12.156
Mean 0.30391
MAD 0.50684
Skewness 3.1829
Sum 4875
Variance 0.63076
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13376 83.4%
 
1 1235 7.7%
 
2 919 5.7%
 
3 304 1.9%
 
4 168 1.0%
 
5 25 0.2%
 
6 10 0.1%
 
7 2 0.0%
 
10 1 0.0%
 
9 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13376 83.4%
 
1 1235 7.7%
 
2 919 5.7%
 
3 304 1.9%
 
4 168 1.0%
 

Maximum 5 values

Value Count Frequency (%)  
5 25 0.2%
 
6 10 0.1%
 
7 2 0.0%
 
9 1 0.0%
 
10 1 0.0%
 

TwoYear_contract_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.32087
Minimum 0
Maximum 7
Zeros (%) 74.7%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 1
95-th percentile 2
Maximum 7
Range 7
Interquartile range 1

Descriptive statistics

Standard deviation 0.62848
Coef of variation 1.9587
Kurtosis 9.2638
Mean 0.32087
MAD 0.47915
Skewness 2.5004
Sum 5147
Variance 0.39498
Memory size 125.4 KiB
Value Count Frequency (%)  
0 11977 74.7%
 
1 3215 20.0%
 
2 684 4.3%
 
3 120 0.7%
 
4 29 0.2%
 
5 10 0.1%
 
6 4 0.0%
 
7 2 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 11977 74.7%
 
1 3215 20.0%
 
2 684 4.3%
 
3 120 0.7%
 
4 29 0.2%
 

Maximum 5 values

Value Count Frequency (%)  
3 120 0.7%
 
4 29 0.2%
 
5 10 0.1%
 
6 4 0.0%
 
7 2 0.0%
 

VerizonUNL_plan_cnt
Numeric

Distinct count 13
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.0565
Minimum 0
Maximum 12
Zeros (%) 54.2%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 2
95-th percentile 4
Maximum 12
Range 12
Interquartile range 2

Descriptive statistics

Standard deviation 1.4359
Coef of variation 1.359
Kurtosis 3.0993
Mean 1.0565
MAD 1.1598
Skewness 1.5501
Sum 16948
Variance 2.0617
Memory size 125.4 KiB
Value Count Frequency (%)  
0 8697 54.2%
 
2 2828 17.6%
 
1 2012 12.5%
 
3 1484 9.3%
 
4 607 3.8%
 
5 231 1.4%
 
6 104 0.6%
 
7 41 0.3%
 
9 15 0.1%
 
8 13 0.1%
 
Other values (3) 9 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 8697 54.2%
 
1 2012 12.5%
 
2 2828 17.6%
 
3 1484 9.3%
 
4 607 3.8%
 

Maximum 5 values

Value Count Frequency (%)  
8 13 0.1%
 
9 15 0.1%
 
10 6 0.0%
 
11 1 0.0%
 
12 2 0.0%
 

VoiceOnly_pplangrp_cnt
Highly correlated

This variable is highly correlated with SingleLine_plan_cnt and should be ignored for analysis

Correlation 0.92297

WearablDev_cnt
Numeric

Distinct count 8
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.19824
Minimum 0
Maximum 7
Zeros (%) 84.0%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 7
Range 7
Interquartile range 0

Descriptive statistics

Standard deviation 0.50085
Coef of variation 2.5264
Kurtosis 11.956
Mean 0.19824
MAD 0.33316
Skewness 3.0056
Sum 3180
Variance 0.25085
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13479 84.0%
 
1 2035 12.7%
 
2 454 2.8%
 
3 61 0.4%
 
4 9 0.1%
 
7 1 0.0%
 
6 1 0.0%
 
5 1 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13479 84.0%
 
1 2035 12.7%
 
2 454 2.8%
 
3 61 0.4%
 
4 9 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
3 61 0.4%
 
4 9 0.1%
 
5 1 0.0%
 
6 1 0.0%
 
7 1 0.0%
 

access_rev_acrtn
Numeric

Distinct count 1672
Unique (%) 10.4%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -0.89539
Minimum -160.6
Maximum 26.36
Zeros (%) 35.1%

Quantile statistics

Minimum -160.6
5-th percentile -5.62
Q1 -1.9
Median 0
Q3 0
95-th percentile 3.35
Maximum 26.36
Range 186.96
Interquartile range 1.9

Descriptive statistics

Standard deviation 3.3545
Coef of variation -3.7464
Kurtosis 340.17
Mean -0.89539
MAD 1.9029
Skewness -8.419
Sum -14363
Variance 11.253
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 5631 35.1%
 
-2.5 122 0.8%
 
-1.67 87 0.5%
 
-2.41 86 0.5%
 
-1.12 79 0.5%
 
-2.17 78 0.5%
 
-2.32 73 0.5%
 
-1.08 73 0.5%
 
-1.16 72 0.4%
 
-2.6 70 0.4%
 
Other values (1662) 9670 60.3%
 

Minimum 5 values

Value Count Frequency (%)  
-160.6 1 0.0%
 
-57.65 1 0.0%
 
-54.44 1 0.0%
 
-41.07 1 0.0%
 
-39.61 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
16.15 1 0.0%
 
19.29 1 0.0%
 
20.37 1 0.0%
 
20.59 1 0.0%
 
26.36 1 0.0%
 

active_loan_cnt
Numeric

Distinct count 44
Unique (%) 0.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 6.6604
Minimum 0
Maximum 121
Zeros (%) 3.5%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 4
Median 4
Q3 9
95-th percentile 16
Maximum 121
Range 121
Interquartile range 5

Descriptive statistics

Standard deviation 6.4232
Coef of variation 0.96438
Kurtosis 27.476
Mean 6.6604
MAD 4.2486
Skewness 3.6466
Sum 106840
Variance 41.257
Memory size 125.4 KiB
Value Count Frequency (%)  
4 5201 32.4%
 
6 2112 13.2%
 
9 1512 9.4%
 
2 1335 8.3%
 
1 1135 7.1%
 
12 799 5.0%
 
16 702 4.4%
 
8 699 4.4%
 
3 632 3.9%
 
0 562 3.5%
 
Other values (34) 1352 8.4%
 

Minimum 5 values

Value Count Frequency (%)  
0 562 3.5%
 
1 1135 7.1%
 
2 1335 8.3%
 
3 632 3.9%
 
4 5201 32.4%
 

Maximum 5 values

Value Count Frequency (%)  
80 2 0.0%
 
90 1 0.0%
 
93 1 0.0%
 
99 1 0.0%
 
121 1 0.0%
 

arpu_amt
Numeric

Distinct count 10222
Unique (%) 63.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 208.3
Minimum 74.23
Maximum 1961.2
Zeros (%) 0.0%

Quantile statistics

Minimum 74.23
5-th percentile 141.38
Q1 170.6
Median 201.68
Q3 234.6
95-th percentile 291.92
Maximum 1961.2
Range 1887
Interquartile range 64

Descriptive statistics

Standard deviation 61.317
Coef of variation 0.29437
Kurtosis 139.88
Mean 208.3
MAD 39.136
Skewness 7.3373
Sum 3341300
Variance 3759.8
Memory size 125.4 KiB
Value Count Frequency (%)  
182.77 8 0.0%
 
189.33 7 0.0%
 
203.67 7 0.0%
 
181.62 7 0.0%
 
225.31 6 0.0%
 
146.43 6 0.0%
 
200.55 6 0.0%
 
195.91 6 0.0%
 
190.17 6 0.0%
 
221.06 6 0.0%
 
Other values (10212) 15976 99.6%
 

Minimum 5 values

Value Count Frequency (%)  
74.23 1 0.0%
 
95.55 1 0.0%
 
96.76 1 0.0%
 
126.34 1 0.0%
 
126.39 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1440.7 1 0.0%
 
1560.64 1 0.0%
 
1615.23 1 0.0%
 
1651.65 1 0.0%
 
1961.24 1 0.0%
 

billed_acct_size
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.8307
Minimum 0
Maximum 11
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1
Q1 2
Median 3
Q3 3
95-th percentile 5
Maximum 11
Range 11
Interquartile range 1

Descriptive statistics

Standard deviation 1.2227
Coef of variation 0.43195
Kurtosis 2.3749
Mean 2.8307
MAD 0.94838
Skewness 1.1885
Sum 45407
Variance 1.495
Memory size 125.4 KiB
Value Count Frequency (%)  
2 6577 41.0%
 
3 4342 27.1%
 
4 2501 15.6%
 
1 1166 7.3%
 
5 931 5.8%
 
6 352 2.2%
 
7 103 0.6%
 
8 39 0.2%
 
9 14 0.1%
 
10 12 0.1%
 
Other values (2) 4 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 3 0.0%
 
1 1166 7.3%
 
2 6577 41.0%
 
3 4342 27.1%
 
4 2501 15.6%
 

Maximum 5 values

Value Count Frequency (%)  
7 103 0.6%
 
8 39 0.2%
 
9 14 0.1%
 
10 12 0.1%
 
11 1 0.0%
 

birth_year
Numeric

Distinct count 84
Unique (%) 0.5%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1977.3
Minimum 0
Maximum 2000
Zeros (%) 0.0%

Quantile statistics

Minimum 0
5-th percentile 1951
Q1 1969
Median 1981
Q3 1989
95-th percentile 1996
Maximum 2000
Range 2000
Interquartile range 20

Descriptive statistics

Standard deviation 40.787
Coef of variation 0.020627
Kurtosis 2064.1
Mean 1977.3
MAD 12.575
Skewness -42.651
Sum 31718279
Variance 1663.5
Memory size 125.4 KiB
Value Count Frequency (%)  
1989 574 3.6%
 
1990 572 3.6%
 
1985 535 3.3%
 
1991 535 3.3%
 
1988 515 3.2%
 
1992 510 3.2%
 
1986 503 3.1%
 
1993 487 3.0%
 
1987 486 3.0%
 
1994 444 2.8%
 
Other values (74) 10880 67.8%
 

Minimum 5 values

Value Count Frequency (%)  
0 6 0.0%
 
1910 1 0.0%
 
1911 2 0.0%
 
1917 1 0.0%
 
1919 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
1996 331 2.1%
 
1997 306 1.9%
 
1998 238 1.5%
 
1999 149 0.9%
 
2000 1 0.0%
 

cust_status_ind
Boolean

Distinct count 2
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Mean 0.91291
1
14644
0
 
1397
Value Count Frequency (%)  
1 14644 91.3%
 
0 1397 8.7%
 

data_overage_amt
Numeric

Distinct count 583
Unique (%) 3.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.83349
Minimum 0
Maximum 216.67
Zeros (%) 83.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 4.29
Maximum 216.67
Range 216.67
Interquartile range 0

Descriptive statistics

Standard deviation 4.5017
Coef of variation 5.401
Kurtosis 563.04
Mean 0.83349
MAD 1.4102
Skewness 18.145
Sum 13370
Variance 20.265
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 13332 83.1%
 
0.48 89 0.6%
 
0.56 58 0.4%
 
0.52 53 0.3%
 
0.5 51 0.3%
 
0.6 51 0.3%
 
0.54 49 0.3%
 
0.58 48 0.3%
 
2.5 44 0.3%
 
0.62 42 0.3%
 
Other values (573) 2224 13.9%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 13332 83.1%
 
0.01 12 0.1%
 
0.02 2 0.0%
 
0.03 3 0.0%
 
0.04 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
108.12 1 0.0%
 
120.0 1 0.0%
 
127.17 1 0.0%
 
150.6 1 0.0%
 
216.67 1 0.0%
 

data_ovrg_Yes_cnt
Numeric

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.21931
Minimum 0
Maximum 4
Zeros (%) 82.9%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1
Maximum 4
Range 4
Interquartile range 0

Descriptive statistics

Standard deviation 0.52378
Coef of variation 2.3883
Kurtosis 5.9749
Mean 0.21931
MAD 0.36359
Skewness 2.4957
Sum 3518
Variance 0.27434
Memory size 125.4 KiB
Value Count Frequency (%)  
0 13297 82.9%
 
1 2020 12.6%
 
2 677 4.2%
 
3 44 0.3%
 
4 3 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 13297 82.9%
 
1 2020 12.6%
 
2 677 4.2%
 
3 44 0.3%
 
4 3 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
0 13297 82.9%
 
1 2020 12.6%
 
2 677 4.2%
 
3 44 0.3%
 
4 3 0.0%
 

eqp_rev_acrtn
Numeric

Distinct count 2127
Unique (%) 13.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -1.5972
Minimum -632.79
Maximum 321.07
Zeros (%) 7.3%

Quantile statistics

Minimum -632.79
5-th percentile -8.92
Q1 -2.08
Median -0.42
Q3 0.32
95-th percentile 2.2
Maximum 321.07
Range 953.86
Interquartile range 2.4

Descriptive statistics

Standard deviation 8.9782
Coef of variation -5.6212
Kurtosis 1920
Mean -1.5972
MAD 2.8281
Skewness -27.681
Sum -25621
Variance 80.609
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 1165 7.3%
 
-0.01 736 4.6%
 
-0.02 230 1.4%
 
-1.47 77 0.5%
 
-1.46 72 0.4%
 
-1.26 67 0.4%
 
-1.22 63 0.4%
 
0.07 61 0.4%
 
-1.3 51 0.3%
 
-1.04 51 0.3%
 
Other values (2117) 13468 84.0%
 

Minimum 5 values

Value Count Frequency (%)  
-632.79 1 0.0%
 
-336.5 1 0.0%
 
-331.46 1 0.0%
 
-261.22 1 0.0%
 
-148.3 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
41.77 1 0.0%
 
45.1 1 0.0%
 
65.61 1 0.0%
 
141.23 1 0.0%
 
321.07 1 0.0%
 

frequency
Numeric

Distinct count 34
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 24.871
Minimum 3
Maximum 54
Zeros (%) 0.0%

Quantile statistics

Minimum 3
5-th percentile 19
Q1 22
Median 25
Q3 28
95-th percentile 31
Maximum 54
Range 51
Interquartile range 6

Descriptive statistics

Standard deviation 4.1412
Coef of variation 0.16651
Kurtosis 0.85829
Mean 24.871
MAD 3.4284
Skewness -0.49876
Sum 398956
Variance 17.149
Memory size 125.4 KiB
Value Count Frequency (%)  
20 1528 9.5%
 
21 1480 9.2%
 
30 1424 8.9%
 
29 1368 8.5%
 
25 1218 7.6%
 
28 1216 7.6%
 
22 1194 7.4%
 
26 1180 7.4%
 
27 1177 7.3%
 
23 1163 7.3%
 
Other values (24) 3093 19.3%
 

Minimum 5 values

Value Count Frequency (%)  
3 4 0.0%
 
4 6 0.0%
 
5 6 0.0%
 
6 5 0.0%
 
7 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
34 1 0.0%
 
44 1 0.0%
 
49 1 0.0%
 
51 1 0.0%
 
54 1 0.0%
 

gl_late_fees_amt
Numeric

Distinct count 2336
Unique (%) 14.6%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 34.746
Minimum -5
Maximum 190.84
Zeros (%) 29.8%

Quantile statistics

Minimum -5
5-th percentile 0
Q1 0
Median 15
Q3 65
95-th percentile 115
Maximum 190.84
Range 195.84
Interquartile range 65

Descriptive statistics

Standard deviation 40.407
Coef of variation 1.1629
Kurtosis -0.43165
Mean 34.746
MAD 34.857
Skewness 0.93456
Sum 557360
Variance 1632.7
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 4780 29.8%
 
5.0 2060 12.8%
 
10.0 911 5.7%
 
15.0 573 3.6%
 
20.0 454 2.8%
 
25.0 355 2.2%
 
30.0 318 2.0%
 
45.0 283 1.8%
 
35.0 278 1.7%
 
40.0 242 1.5%
 
Other values (2326) 5787 36.1%
 

Minimum 5 values

Value Count Frequency (%)  
-5.0 3 0.0%
 
0.0 4780 29.8%
 
0.15 1 0.0%
 
0.35 2 0.0%
 
0.54 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
166.93 1 0.0%
 
174.03 1 0.0%
 
175.78 1 0.0%
 
182.99 1 0.0%
 
190.84 1 0.0%
 

gl_nonrecur_rev_amt
Numeric

Distinct count 7754
Unique (%) 48.3%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 125.9
Minimum -2003.3
Maximum 5005.7
Zeros (%) 11.1%

Quantile statistics

Minimum -2003.3
5-th percentile 0
Q1 6.99
Median 55
Q3 160
95-th percentile 517.26
Maximum 5005.7
Range 7009
Interquartile range 153.01

Descriptive statistics

Standard deviation 223.47
Coef of variation 1.775
Kurtosis 44.729
Mean 125.9
MAD 135.61
Skewness 4.4649
Sum 2019600
Variance 49939
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 1775 11.1%
 
5.0 647 4.0%
 
10.0 283 1.8%
 
15.0 265 1.7%
 
20.0 193 1.2%
 
30.0 182 1.1%
 
1.99 176 1.1%
 
35.0 140 0.9%
 
25.0 140 0.9%
 
45.0 118 0.7%
 
Other values (7744) 12122 75.6%
 

Minimum 5 values

Value Count Frequency (%)  
-2003.31 1 0.0%
 
-1085.52 1 0.0%
 
-1083.48 1 0.0%
 
-681.54 1 0.0%
 
-641.83 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
3025.23 1 0.0%
 
3146.06 1 0.0%
 
3719.28 1 0.0%
 
3770.0 1 0.0%
 
5005.71 1 0.0%
 

handset_paychnl_cnt
Numeric

Distinct count 12
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 1.5335
Minimum 0
Maximum 11
Zeros (%) 40.1%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 2
Q3 3
95-th percentile 4
Maximum 11
Range 11
Interquartile range 3

Descriptive statistics

Standard deviation 1.5536
Coef of variation 1.0131
Kurtosis 0.63202
Mean 1.5335
MAD 1.3184
Skewness 0.83318
Sum 24599
Variance 2.4137
Memory size 125.4 KiB
Value Count Frequency (%)  
0 6425 40.1%
 
2 4226 26.3%
 
3 2304 14.4%
 
1 1352 8.4%
 
4 1142 7.1%
 
5 377 2.4%
 
6 134 0.8%
 
7 48 0.3%
 
8 16 0.1%
 
9 9 0.1%
 
Other values (2) 8 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
0 6425 40.1%
 
1 1352 8.4%
 
2 4226 26.3%
 
3 2304 14.4%
 
4 1142 7.1%
 

Maximum 5 values

Value Count Frequency (%)  
7 48 0.3%
 
8 16 0.1%
 
9 9 0.1%
 
10 7 0.0%
 
11 1 0.0%
 

index
Numeric

Distinct count 16041
Unique (%) 100.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 380260
Minimum 14
Maximum 758148
Zeros (%) 0.0%

Quantile statistics

Minimum 14
5-th percentile 36635
Q1 191500
Median 380340
Q3 571180
95-th percentile 719930
Maximum 758148
Range 758134
Interquartile range 379690

Descriptive statistics

Standard deviation 219550
Coef of variation 0.57737
Kurtosis -1.205
Mean 380260
MAD 190310
Skewness -0.013489
Sum 6099718136
Variance 48203000000
Memory size 125.4 KiB
Value Count Frequency (%)  
1 441414 2751.8%
 
1 283455 1767.1%
 
1 133972 835.2%
 
1 577659 3601.1%
 
1 9041 56.4%
 
1 207696 1294.8%
 
1 219982 1371.4%
 
1 377677 2354.4%
 
1 281420 1754.4%
 
1 451401 2814.0%
 
Other values (16031) 6096734419 38007196.7%
 

Minimum 5 values

Value Count Frequency (%)  
1 441414 2751.8%
 
1 283455 1767.1%
 
1 133972 835.2%
 
1 577659 3601.1%
 
1 9041 56.4%
 

Maximum 5 values

Value Count Frequency (%)  
1 740755 4617.9%
 
1 17810 111.0%
 
1 155025 966.4%
 
1 601487 3749.7%
 
1 387633 2416.5%
 

monetary
Numeric

Distinct count 15507
Unique (%) 96.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 4999.9
Minimum 3899.9
Maximum 6401.2
Zeros (%) 0.0%

Quantile statistics

Minimum 3899.9
5-th percentile 3989
Q1 4357.8
Median 4935
Q3 5597
95-th percentile 6219.1
Maximum 6401.2
Range 2501.3
Interquartile range 1239.2

Descriptive statistics

Standard deviation 720.27
Coef of variation 0.14406
Kurtosis -1.1492
Mean 4999.9
MAD 622.71
Skewness 0.24149
Sum 80203000
Variance 518790
Memory size 125.4 KiB
Value Count Frequency (%)  
5039.99 3 0.0%
 
4002.71 3 0.0%
 
3933.37 3 0.0%
 
4576.59 3 0.0%
 
6108.99 3 0.0%
 
4978.73 3 0.0%
 
5708.36 3 0.0%
 
5245.14 3 0.0%
 
4059.72 3 0.0%
 
5060.52 2 0.0%
 
Other values (15497) 16012 99.8%
 

Minimum 5 values

Value Count Frequency (%)  
3899.91 1 0.0%
 
3903.62 1 0.0%
 
3903.77 1 0.0%
 
3908.84 1 0.0%
 
3909.66 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
6399.24 1 0.0%
 
6399.35 1 0.0%
 
6399.39 1 0.0%
 
6401.15 1 0.0%
 
6401.19 1 0.0%
 

pplan_access_acrtn
Numeric

Distinct count 746
Unique (%) 4.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 4.8263
Minimum -4
Maximum 946.67
Zeros (%) 45.6%

Quantile statistics

Minimum -4
5-th percentile 0
Q1 0
Median 1.33
Q3 7.2
95-th percentile 18.33
Maximum 946.67
Range 950.67
Interquartile range 7.2

Descriptive statistics

Standard deviation 11.552
Coef of variation 2.3935
Kurtosis 2783
Mean 4.8263
MAD 5.3003
Skewness 36.691
Sum 77419
Variance 133.44
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 7318 45.6%
 
2.17 209 1.3%
 
2.24 208 1.3%
 
8.8 172 1.1%
 
5.0 171 1.1%
 
8.46 171 1.1%
 
2.41 134 0.8%
 
2.32 132 0.8%
 
8.15 132 0.8%
 
4.48 131 0.8%
 
Other values (736) 7263 45.3%
 

Minimum 5 values

Value Count Frequency (%)  
-4.0 1 0.0%
 
-3.93 1 0.0%
 
-3.57 1 0.0%
 
-2.71 1 0.0%
 
-2.5 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
165.0 1 0.0%
 
183.33 1 0.0%
 
192.5 1 0.0%
 
210.0 1 0.0%
 
946.67 1 0.0%
 

recency
Numeric

Distinct count 27
Unique (%) 0.2%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 71.98
Minimum 51
Maximum 931
Zeros (%) 0.0%

Quantile statistics

Minimum 51
5-th percentile 51
Q1 51
Median 51
Q3 51
95-th percentile 232
Maximum 931
Range 880
Interquartile range 0

Descriptive statistics

Standard deviation 76.021
Coef of variation 1.0561
Kurtosis 21.897
Mean 71.98
MAD 37.702
Skewness 4.4083
Sum 1154638
Variance 5779.3
Memory size 125.4 KiB
Value Count Frequency (%)  
51 14413 89.9%
 
232 163 1.0%
 
112 162 1.0%
 
173 146 0.9%
 
201 145 0.9%
 
263 141 0.9%
 
81 137 0.9%
 
142 134 0.8%
 
293 109 0.7%
 
354 85 0.5%
 
Other values (17) 406 2.5%
 

Minimum 5 values

Value Count Frequency (%)  
51 14413 89.9%
 
81 137 0.9%
 
112 162 1.0%
 
142 134 0.8%
 
173 146 0.9%
 

Maximum 5 values

Value Count Frequency (%)  
719 6 0.0%
 
750 3 0.0%
 
781 1 0.0%
 
872 1 0.0%
 
931 1 0.0%
 

recurring_paychnl_cnt
Numeric

Distinct count 11
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2.0046
Minimum 0
Maximum 10
Zeros (%) 27.5%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 2
Q3 3
95-th percentile 5
Maximum 10
Range 10
Interquartile range 3

Descriptive statistics

Standard deviation 1.6164
Coef of variation 0.80633
Kurtosis 0.071252
Mean 2.0046
MAD 1.2431
Skewness 0.49083
Sum 32156
Variance 2.6127
Memory size 125.4 KiB
Value Count Frequency (%)  
2 4797 29.9%
 
0 4406 27.5%
 
3 3007 18.7%
 
4 1693 10.6%
 
1 1111 6.9%
 
5 666 4.2%
 
6 257 1.6%
 
7 67 0.4%
 
8 20 0.1%
 
9 13 0.1%
 

Minimum 5 values

Value Count Frequency (%)  
0 4406 27.5%
 
1 1111 6.9%
 
2 4797 29.9%
 
3 3007 18.7%
 
4 1693 10.6%
 

Maximum 5 values

Value Count Frequency (%)  
6 257 1.6%
 
7 67 0.4%
 
8 20 0.1%
 
9 13 0.1%
 
10 4 0.0%
 

tot_revenue_acrtn
Numeric

Distinct count 3546
Unique (%) 22.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean -7.856
Minimum -727.62
Maximum 602.86
Zeros (%) 0.1%

Quantile statistics

Minimum -727.62
5-th percentile -26.05
Q1 -10.03
Median -5.14
Q3 -2.24
95-th percentile 1.51
Maximum 602.86
Range 1330.5
Interquartile range 7.79

Descriptive statistics

Standard deviation 15.397
Coef of variation -1.9599
Kurtosis 629.85
Mean -7.856
MAD 6.9519
Skewness -4.7067
Sum -126020
Variance 237.06
Memory size 125.4 KiB
Value Count Frequency (%)  
-2.48 28 0.2%
 
-3.24 25 0.2%
 
-3.76 24 0.1%
 
-1.86 23 0.1%
 
-4.48 23 0.1%
 
-4.23 23 0.1%
 
-4.68 22 0.1%
 
-3.09 22 0.1%
 
-2.05 21 0.1%
 
-4.25 21 0.1%
 
Other values (3536) 15809 98.6%
 

Minimum 5 values

Value Count Frequency (%)  
-727.62 1 0.0%
 
-412.54 1 0.0%
 
-403.97 1 0.0%
 
-321.79 1 0.0%
 
-286.51 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
80.4 1 0.0%
 
133.69 1 0.0%
 
138.29 1 0.0%
 
526.49 1 0.0%
 
602.86 1 0.0%
 

usage_rev_acrtn
Numeric

Distinct count 486
Unique (%) 3.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.043702
Minimum -261.68
Maximum 57.5
Zeros (%) 87.7%

Quantile statistics

Minimum -261.68
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 0.37
Maximum 57.5
Range 319.18
Interquartile range 0

Descriptive statistics

Standard deviation 2.42
Coef of variation 55.375
Kurtosis 8596.9
Mean 0.043702
MAD 0.20351
Skewness -78.958
Sum 701.02
Variance 5.8563
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 14072 87.7%
 
0.07 77 0.5%
 
0.09 74 0.5%
 
0.08 55 0.3%
 
0.1 42 0.3%
 
0.34 34 0.2%
 
0.5 33 0.2%
 
0.48 32 0.2%
 
0.01 30 0.2%
 
0.4 27 0.2%
 
Other values (476) 1565 9.8%
 

Minimum 5 values

Value Count Frequency (%)  
-261.68 1 0.0%
 
-60.71 1 0.0%
 
-46.45 1 0.0%
 
-33.27 1 0.0%
 
-22.87 1 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
26.14 1 0.0%
 
28.0 1 0.0%
 
31.82 1 0.0%
 
40.46 1 0.0%
 
57.5 1 0.0%
 

voice_overage_amt
Numeric

Distinct count 598
Unique (%) 3.7%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 0.50499
Minimum 0
Maximum 1031
Zeros (%) 88.4%

Quantile statistics

Minimum 0
5-th percentile 0
Q1 0
Median 0
Q3 0
95-th percentile 1.02
Maximum 1031
Range 1031
Interquartile range 0

Descriptive statistics

Standard deviation 11.22
Coef of variation 22.218
Kurtosis 6802.6
Mean 0.50499
MAD 0.92241
Skewness 79.039
Sum 8100.6
Variance 125.89
Memory size 125.4 KiB
Value Count Frequency (%)  
0.0 14178 88.4%
 
0.07 63 0.4%
 
0.09 46 0.3%
 
0.06 41 0.3%
 
0.12 33 0.2%
 
0.14 33 0.2%
 
0.08 32 0.2%
 
0.16 25 0.2%
 
0.13 24 0.1%
 
0.36 22 0.1%
 
Other values (588) 1544 9.6%
 

Minimum 5 values

Value Count Frequency (%)  
0.0 14178 88.4%
 
0.02 1 0.0%
 
0.03 18 0.1%
 
0.04 20 0.1%
 
0.05 9 0.1%
 

Maximum 5 values

Value Count Frequency (%)  
119.89 1 0.0%
 
153.03 1 0.0%
 
153.94 1 0.0%
 
880.56 1 0.0%
 
1031.04 1 0.0%
 

Correlations

Sample

cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
283582 1 51 21 4451.09 0 0 5 15 1984 4 2 3 5 0 0 0 3 4 0 2 0 5 0 0 0 2 0 4 0 1 1 0 0 0 0 0 211.96 0.0 0.00 0.71 -15.14 9.29 -1.10 0.00 0.00 0.00 3
696787 1 51 26 4092.24 0 1 2 3 1988 0 3 3 2 0 0 0 1 2 0 0 0 0 0 0 0 3 0 2 3 0 2 0 1 0 0 0 157.39 45.0 112.34 0.00 -12.66 -2.50 0.40 0.19 0.00 0.00 3
458097 1 51 26 4208.64 0 4 4 8 1961 0 0 0 4 4 0 0 2 4 2 0 0 0 0 2 4 0 0 0 0 1 1 0 0 0 1 0 161.87 0.0 73.04 16.92 -2.73 -0.55 1.08 0.00 0.00 1.58 3
124254 1 51 22 4813.30 2 0 4 4 1948 0 4 3 4 0 0 0 1 0 1 0 0 0 4 0 0 0 0 0 4 3 3 0 0 0 0 0 218.79 5.0 199.62 0.00 -10.34 -1.59 0.00 0.00 0.00 0.00 3
30788 1 51 31 6013.69 1 1 4 12 1980 0 2 1 4 0 0 1 3 0 1 0 2 0 0 0 0 0 2 0 1 0 1 0 3 0 0 2 193.99 0.0 -83.24 1.94 -8.10 -0.17 -4.87 0.90 2.58 0.00 3

Cluster 3 - Gold Findings:

1. Recency - High (Mean - 70.6)
2. Frequency - High (Mean - 24.9)
3. Monetary - High (Mean - 5003.7)
6. IphoneDev_cnt - 80%
10. ARPU Amount - 208.06 (Mean)
11. billed_acct_size - 2.8353 (Mean)
12. Birth Year - 1977 (Mean)
13. % of AT&T port in - 29.1%
14. % of Sprint_port in - 18%
15. % of T mobile port in - 17%
16. % of Android Devices - 49%
17. CA users - 47.7%
18. TX users - 32%
19. CommunicStore_chnl_cnt - 58%
20. Edge_contract_cnt - 97.4% 
22. SingleLine_plan_cnt - 3%
23. data_ovrg_Yes_cnt - 21%
27. Purch_equip_cnt - 43.5%
29. WearablDev_cnt - 16%
30. active_loan_cnt - 50%
31. Total Revenue Acrtn - -30.913(Mean)
32. MTM_contract_cnt - 90%
In [48]:
df[df['Customer_Cluster']== 3].head(10)
Out[48]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
283582 1 51 21 4451.09 0 0 5 15 1984 4 2 3 5 0 0 0 3 4 0 2 0 5 0 0 0 2 0 4 0 1 1 0 0 0 0 0 211.96 0.00 0.00 0.71 -15.14 9.29 -1.10 0.00 0.00 0.00 3
696787 1 51 26 4092.24 0 1 2 3 1988 0 3 3 2 0 0 0 1 2 0 0 0 0 0 0 0 3 0 2 3 0 2 0 1 0 0 0 157.39 45.00 112.34 0.00 -12.66 -2.50 0.40 0.19 0.00 0.00 3
458097 1 51 26 4208.64 0 4 4 8 1961 0 0 0 4 4 0 0 2 4 2 0 0 0 0 2 4 0 0 0 0 1 1 0 0 0 1 0 161.87 0.00 73.04 16.92 -2.73 -0.55 1.08 0.00 0.00 1.58 3
124254 1 51 22 4813.30 2 0 4 4 1948 0 4 3 4 0 0 0 1 0 1 0 0 0 4 0 0 0 0 0 4 3 3 0 0 0 0 0 218.79 5.00 199.62 0.00 -10.34 -1.59 0.00 0.00 0.00 0.00 3
30788 1 51 31 6013.69 1 1 4 12 1980 0 2 1 4 0 0 1 3 0 1 0 2 0 0 0 0 0 2 0 1 0 1 0 3 0 0 2 193.99 0.00 -83.24 1.94 -8.10 -0.17 -4.87 0.90 2.58 0.00 3
544939 1 51 29 5951.16 0 2 2 4 1992 0 0 2 2 0 0 0 2 0 0 0 0 0 0 1 0 0 0 0 2 0 1 0 0 0 0 0 205.21 90.32 146.31 9.83 -3.61 -2.25 -0.94 0.00 0.00 0.00 3
147130 1 51 26 4159.21 0 1 1 1 1955 0 0 0 0 1 1 0 1 0 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 159.97 85.00 125.00 2.50 -0.85 0.00 0.26 0.00 0.00 0.00 3
582837 1 51 26 6077.17 0 2 3 9 1983 0 2 2 3 3 3 1 3 2 0 0 1 0 0 1 0 0 1 2 3 0 0 0 0 1 1 0 233.74 0.00 20.00 0.58 -4.12 0.38 2.50 -0.77 0.77 0.00 3
366877 0 446 15 4608.89 0 1 2 6 1911 0 2 3 3 0 3 0 2 0 0 0 1 3 0 0 0 2 0 0 3 1 1 0 0 0 0 0 307.26 55.05 102.05 4.00 -19.47 -4.33 0.79 0.00 0.00 0.00 3
59231 1 51 31 4497.94 2 1 2 2 1983 0 0 1 1 2 0 1 1 2 0 0 0 0 0 1 0 0 0 2 0 1 1 0 0 0 1 0 145.09 10.00 41.99 4.68 -2.51 -0.48 0.77 0.00 0.97 0.00 3

Cluster 3 contains average customers with medium monetary values. We can name this cluster as "Gold"

In [49]:
df.columns
Out[49]:
Index(['cust_status_ind', 'recency', 'frequency', 'monetary',
       'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt', 'billed_acct_size',
       'active_loan_cnt', 'birth_year', 'BeyondUnlmtd_plan_cnt',
       'GoUnlimit_plan_cnt', 'IphoneDev_cnt', 'recurring_paychnl_cnt',
       'CA_state_cd', 'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt',
       'Edge_contract_cnt', 'CommunicStore_chnl_cnt', 'AndroidDev_cnt',
       'WearablDev_cnt', 'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt',
       'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt',
       'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt',
       'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt',
       'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt',
       'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt', 'gl_nonrecur_rev_amt',
       'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn',
       'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt',
       'voice_overage_amt', 'Customer_Cluster'],
      dtype='object')

Cluster Naming:

1. Cluster 0 : Silver 
2. Cluster 1 : Platinum
3. Cluster 2 : Bronze
4. Cluster 3 : Gold
In [50]:
# Renaming the clusters in the original dataframe:
df['Customer_Cluster'] = np.select([df.Customer_Cluster == 0, df.Customer_Cluster == 1, df.Customer_Cluster == 2, df.Customer_Cluster == 3], 
                        ['Silver','Platinum','Bronze','Gold'])
In [51]:
df.Customer_Cluster.value_counts()
Out[51]:
Silver      28578
Bronze      19777
Gold        16041
Platinum     9187
Name: Customer_Cluster, dtype: int64
In [52]:
sns.countplot('Customer_Cluster',data=df)
Out[52]:
<matplotlib.axes._subplots.AxesSubplot at 0x1dc1963da90>
In [53]:
df1 = pd.read_csv("fact_bill_ltv_smry_data_0821_Cust.txt", sep = '|')
In [54]:
df1.head()
Out[54]:
Unnamed: 0 525298014 17/03/04 0 719 7 904.34 0.1 1 1.1 1.2 1.3 1.4 1996 75150 1.5 0.2 0.3 1.6 1.7 0.4 0.5 0.6 1.8 1.9 0.7 0.8 0.9 1.10 1.11 0.10 0.11 1.12 0.12 0.13 0.14 0.15 1.13 0.16 0.17 0.18 0.19 0.20 129.19 25.00 646.29 613.33 32.96 9.29 .83 .00 .34 .28 .00.1 .00
0 NaN 925381920 17/07/22 1 51 25 8012.49 5 0 5 5 5 15 1966 12309 3 0 5 3 5 0 0 0 3 5 0 0 1 0 0 2 0 0 1 5 0 2 0 0 0 0 2 0 320.50 5.00 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0
1 NaN 825437339 17/11/01 1 51 21 5977.00 0 0 3 3 3 9 1993 76104 3 0 3 3 3 0 0 0 3 2 0 0 0 3 0 0 0 3 0 0 3 0 1 0 0 0 0 0 284.62 96.97 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0
2 NaN 689017953 17/10/14 1 51 22 5842.94 0 0 4 4 4 16 1971 12601 4 0 4 0 4 0 0 0 4 4 4 0 0 0 0 2 0 0 0 0 4 0 4 0 4 0 0 0 265.59 5.40 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0
3 NaN 373396651 17/11/27 1 51 21 6018.78 0 0 2 2 2 4 1984 95337 2 2 0 2 0 2 0 0 2 0 0 0 0 0 2 0 0 2 0 2 0 0 2 0 0 0 0 0 286.61 0.00 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0
4 NaN 588971563 17/06/19 1 51 26 7135.08 4 0 4 4 4 12 1960 14883 4 0 4 3 4 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 1 274.43 15.00 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0
In [55]:
df1.columns = ['sno','cust_id', 'cust_estb_dt', 'cust_status_ind', 'recency', 'frequency', 'monetary', 'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt', 'billed_acct_size', 'VoiceTextData_pplangrp_cnt', 'VoiceData_revtype_cnt', 'active_loan_cnt', 'birth_year', 'zip5_cd', 'Loand_equip_cnt', 'BeyondUnlmtd_plan_cnt', 'GoUnlimit_plan_cnt', 'IphoneDev_cnt', 'recurring_paychnl_cnt', 'CA_state_cd', 'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt', 'Edge_contract_cnt', 'CommunicStore_chnl_cnt', 'AndroidDev_cnt', 'WearablDev_cnt', 'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt', 'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt', 'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt', 'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt', 'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt', 'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt', 'gl_tot_service_rev_amt', 'gl_recur_rev_amt', 'gl_nonrecur_rev_amt', 'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn', 'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt', 'voice_overage_amt']
df1 = df1.drop('sno', axis=1)
In [56]:
df1 = df1[df1.monetary > -3335.15375]
df1 = df1[df1.monetary < 9999.65625]
In [57]:
df2 = df['Customer_Cluster']
In [58]:
result = pd.concat([df1, df2], axis=1)
In [59]:
result.head()
Out[59]:
cust_id cust_estb_dt cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size VoiceTextData_pplangrp_cnt VoiceData_revtype_cnt active_loan_cnt birth_year zip5_cd Loand_equip_cnt BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_tot_service_rev_amt gl_recur_rev_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
0 925381920 17/07/22 1 51 25 8012.49 5 0 5 5 5 15 1966 12309 3 0 5 3 5 0 0 0 3 5 0 0 1 0 0 2 0 0 1 5 0 2 0 0 0 0 2 0 320.50 5.00 3934.73 3723.20 211.53 0.0 -7.20 6.04 -10.14 0.00 0.0 0.0 NaN
1 825437339 17/11/01 1 51 21 5977.00 0 0 3 3 3 9 1993 76104 3 0 3 3 3 0 0 0 3 2 0 0 0 3 0 0 0 3 0 0 3 0 1 0 0 0 0 0 284.62 96.97 3166.38 2882.41 283.97 0.0 -12.15 0.00 -4.30 0.00 0.0 0.0 NaN
2 689017953 17/10/14 1 51 22 5842.94 0 0 4 4 4 16 1971 12601 4 0 4 0 4 0 0 0 4 4 4 0 0 0 0 2 0 0 0 0 4 0 4 0 4 0 0 0 265.59 5.40 3051.38 3042.00 9.38 0.0 -12.80 0.00 -1.02 0.00 0.0 0.0 NaN
3 373396651 17/11/27 1 51 21 6018.78 0 0 2 2 2 4 1984 95337 2 2 0 2 0 2 0 0 2 0 0 0 0 0 2 0 0 2 0 2 0 0 2 0 0 0 0 0 286.61 0.00 3570.00 3570.00 0.00 0.0 -3.36 0.00 -0.61 0.00 0.0 0.0 NaN
4 588971563 17/06/19 1 51 26 7135.08 4 0 4 4 4 12 1960 14883 4 0 4 3 4 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 1 274.43 15.00 4102.56 4000.00 102.56 0.0 2.40 -1.92 -0.34 0.15 0.0 0.0 NaN
In [60]:
#result.to_csv (r'Final_table.csv', index = None, header=True)

Classification Models

In [61]:
df.head()
Out[61]:
cust_status_ind recency frequency monetary NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt gl_late_fees_amt gl_nonrecur_rev_amt pplan_access_acrtn tot_revenue_acrtn access_rev_acrtn eqp_rev_acrtn usage_rev_acrtn data_overage_amt voice_overage_amt Customer_Cluster
56841 1 51 27 2164.37 1 0 1 1 1952 0 0 1 1 0 0 0 1 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 0 80.16 5.0 6.99 1.30 -1.61 -0.29 0.00 0.0 0.00 0.0 Silver
13719 1 51 28 6943.23 0 4 4 8 1975 0 0 3 0 0 0 0 2 0 2 0 0 4 0 0 0 0 0 4 4 2 4 0 0 0 0 0 247.97 35.0 35.00 15.71 -0.92 0.00 1.49 0.0 0.00 0.0 Platinum
237065 1 51 27 1828.20 2 0 1 2 1932 0 0 0 0 0 0 0 2 2 2 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 67.71 0.0 0.00 1.48 -3.27 0.00 -0.78 0.0 0.00 0.0 Silver
62699 1 51 24 2805.00 1 1 1 1 1991 0 1 1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0 0 0 116.88 5.0 -46.28 1.67 -0.10 -1.04 2.00 0.0 1.25 0.0 Silver
283582 1 51 21 4451.09 0 0 5 15 1984 4 2 3 5 0 0 0 3 4 0 2 0 5 0 0 0 2 0 4 0 1 1 0 0 0 0 0 211.96 0.0 0.00 0.71 -15.14 9.29 -1.10 0.0 0.00 0.0 Gold
In [62]:
df.info()
<class 'pandas.core.frame.DataFrame'>
Int64Index: 73583 entries, 56841 to 491409
Data columns (total 47 columns):
cust_status_ind           73583 non-null int64
recency                   73583 non-null int64
frequency                 73583 non-null int64
monetary                  73583 non-null float64
NewVZPlan_plan_cnt        73583 non-null int64
VerizonUNL_plan_cnt       73583 non-null int64
billed_acct_size          73583 non-null int64
active_loan_cnt           73583 non-null int64
birth_year                73583 non-null int64
BeyondUnlmtd_plan_cnt     73583 non-null int64
GoUnlimit_plan_cnt        73583 non-null int64
IphoneDev_cnt             73583 non-null int64
recurring_paychnl_cnt     73583 non-null int64
CA_state_cd               73583 non-null int64
Hispanic_ethnicity_cnt    73583 non-null int64
data_ovrg_Yes_cnt         73583 non-null int64
Edge_contract_cnt         73583 non-null int64
CommunicStore_chnl_cnt    73583 non-null int64
AndroidDev_cnt            73583 non-null int64
WearablDev_cnt            73583 non-null int64
SingleLine_plan_cnt       73583 non-null int64
TX_state_cd               73583 non-null int64
ATT_port_cnt              73583 non-null int64
DataOnly_Revtype_cnt      73583 non-null int64
MatureAdults_cnt          73583 non-null int64
handset_paychnl_cnt       73583 non-null int64
VoiceOnly_pplangrp_cnt    73583 non-null int64
SngleCpleKids_cnt         73583 non-null int64
FamTeens_cnt              73583 non-null int64
Purch_equip_cnt           73583 non-null int64
MTM_contract_cnt          73583 non-null int64
T_Mobile_port_cnt         73583 non-null int64
Sprint_port_cnt           73583 non-null int64
LocalRetailer_chnl_cnt    73583 non-null int64
TwoYear_contract_cnt      73583 non-null int64
BasicDev_cnt              73583 non-null int64
arpu_amt                  73583 non-null float64
gl_late_fees_amt          73583 non-null float64
gl_nonrecur_rev_amt       73583 non-null float64
pplan_access_acrtn        73583 non-null float64
tot_revenue_acrtn         73583 non-null float64
access_rev_acrtn          73583 non-null float64
eqp_rev_acrtn             73583 non-null float64
usage_rev_acrtn           73583 non-null float64
data_overage_amt          73583 non-null float64
voice_overage_amt         73583 non-null float64
Customer_Cluster          73583 non-null object
dtypes: float64(11), int64(35), object(1)
memory usage: 29.4+ MB

Distribution of Output Variable:

In [63]:
plt.figure(figsize=(20,7))
plt.subplot(121)
df["Customer_Cluster"].value_counts().plot.pie(autopct = "%1.0f%%",colors = sns.color_palette("prism",4),startangle = 60,labels=["Silver","Bronze","Gold","Platinum"],
                                                                        wedgeprops={"linewidth":2,"edgecolor":"k"},explode=[0,0.1,0.2,0.3],shadow =True)
plt.title("Distribution of Target Variable")

plt.subplot(122)
ax = df["Customer_Cluster"].value_counts().plot(kind="barh")

for i,j in enumerate(df["Customer_Cluster"].value_counts().values):
    ax.text(.7,i,j,weight = "bold",fontsize=20)

plt.title("Count of Cluster")
plt.show()
In [64]:
df.columns
Out[64]:
Index(['cust_status_ind', 'recency', 'frequency', 'monetary',
       'NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt', 'billed_acct_size',
       'active_loan_cnt', 'birth_year', 'BeyondUnlmtd_plan_cnt',
       'GoUnlimit_plan_cnt', 'IphoneDev_cnt', 'recurring_paychnl_cnt',
       'CA_state_cd', 'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt',
       'Edge_contract_cnt', 'CommunicStore_chnl_cnt', 'AndroidDev_cnt',
       'WearablDev_cnt', 'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt',
       'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt',
       'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt',
       'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt',
       'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt',
       'BasicDev_cnt', 'arpu_amt', 'gl_late_fees_amt', 'gl_nonrecur_rev_amt',
       'pplan_access_acrtn', 'tot_revenue_acrtn', 'access_rev_acrtn',
       'eqp_rev_acrtn', 'usage_rev_acrtn', 'data_overage_amt',
       'voice_overage_amt', 'Customer_Cluster'],
      dtype='object')

Since we have already removed highly correlated features before performing Clustering, there is no need to apply correlation check again.

Classification for 2019 Customers (New customers):

In [65]:
from sklearn.model_selection import train_test_split

X = df[['NewVZPlan_plan_cnt', 'VerizonUNL_plan_cnt', 'billed_acct_size',
       'active_loan_cnt', 'birth_year', 'BeyondUnlmtd_plan_cnt',
       'GoUnlimit_plan_cnt', 'IphoneDev_cnt', 'recurring_paychnl_cnt',
       'CA_state_cd', 'Hispanic_ethnicity_cnt', 'data_ovrg_Yes_cnt',
       'Edge_contract_cnt', 'CommunicStore_chnl_cnt', 'AndroidDev_cnt',
       'WearablDev_cnt', 'SingleLine_plan_cnt', 'TX_state_cd', 'ATT_port_cnt',
       'DataOnly_Revtype_cnt', 'MatureAdults_cnt', 'handset_paychnl_cnt',
       'VoiceOnly_pplangrp_cnt', 'SngleCpleKids_cnt', 'FamTeens_cnt',
       'Purch_equip_cnt', 'MTM_contract_cnt', 'T_Mobile_port_cnt',
       'Sprint_port_cnt', 'LocalRetailer_chnl_cnt', 'TwoYear_contract_cnt',
       'BasicDev_cnt', 'arpu_amt','data_overage_amt','voice_overage_amt']]
y = df['Customer_Cluster']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
In [66]:
X_train.head()
Out[66]:
NewVZPlan_plan_cnt VerizonUNL_plan_cnt billed_acct_size active_loan_cnt birth_year BeyondUnlmtd_plan_cnt GoUnlimit_plan_cnt IphoneDev_cnt recurring_paychnl_cnt CA_state_cd Hispanic_ethnicity_cnt data_ovrg_Yes_cnt Edge_contract_cnt CommunicStore_chnl_cnt AndroidDev_cnt WearablDev_cnt SingleLine_plan_cnt TX_state_cd ATT_port_cnt DataOnly_Revtype_cnt MatureAdults_cnt handset_paychnl_cnt VoiceOnly_pplangrp_cnt SngleCpleKids_cnt FamTeens_cnt Purch_equip_cnt MTM_contract_cnt T_Mobile_port_cnt Sprint_port_cnt LocalRetailer_chnl_cnt TwoYear_contract_cnt BasicDev_cnt arpu_amt data_overage_amt voice_overage_amt
460760 0 0 5 20 1988 4 0 5 5 5 5 0 4 5 0 0 0 0 0 2 0 5 0 5 0 1 2 4 0 0 0 0 362.66 0.0 0.0
337749 1 1 1 1 1993 0 0 1 1 0 0 0 1 1 0 0 0 1 1 0 0 1 0 0 1 0 1 0 0 0 0 0 130.91 0.0 0.0
702025 2 0 2 4 1999 2 2 1 2 0 0 0 2 1 0 1 0 0 0 2 0 2 0 0 0 0 1 0 1 0 0 0 115.81 0.0 0.0
454031 0 0 1 1 1946 0 0 0 0 0 0 0 1 1 0 0 1 0 0 0 1 0 1 0 0 0 1 0 0 0 0 1 33.43 0.0 0.0
136394 0 0 1 0 1993 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 0 0 33.33 0.0 0.0

Standard Scaler for Feature Scaling

In [67]:
# Feature Scaling:

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

Decision Tree Classifier

In [68]:
from sklearn.metrics import roc_curve, precision_recall_curve, auc, make_scorer, recall_score, accuracy_score, classification_report, precision_score, confusion_matrix
In [69]:
# ---- Decision Tree -----------
from sklearn import tree


dtree = tree.DecisionTreeClassifier(criterion='entropy', max_depth=5)
dtree.fit(X_train, y_train)
y_pred_dtree = dtree.predict(X_test) 
print('Accuracy of decision tree on training set: {:.2f}'
     .format(dtree.score(X_train, y_train)))
print('Accuracy of decision tree  on validation set: {:.2f}'
     .format(dtree.score(X_test, y_test)))
print('Clasification report of Decision Treeclassifier on validation set: \n{}'
     .format(classification_report(y_test, y_pred_dtree)))
print(confusion_matrix(y_test, y_pred_dtree))
Accuracy of decision tree on training set: 0.74
Accuracy of decision tree  on validation set: 0.74
Clasification report of Decision Treeclassifier on validation set: 
              precision    recall  f1-score   support

      Bronze       0.81      0.69      0.75      3931
        Gold       0.66      0.71      0.69      3243
    Platinum       0.69      0.81      0.75      1828
      Silver       0.76      0.76      0.76      5715

   micro avg       0.74      0.74      0.74     14717
   macro avg       0.73      0.74      0.74     14717
weighted avg       0.74      0.74      0.74     14717

[[2723  217  161  830]
 [  12 2299  356  576]
 [   0  341 1487    0]
 [ 616  609  138 4352]]

Random Forest Classifier

In [70]:
# ---- Random Forest -----------
from sklearn.ensemble import RandomForestClassifier

rfc = RandomForestClassifier(n_estimators=100)
rfc.fit(X_train, y_train)
y_pred_rfc = rfc.predict(X_test) 

print('Accuracy of Random Forest on training set: {:.2f}'
     .format(rfc.score(X_train, y_train)))
print('Accuracy of Random Forest  on validation set: {:.2f}'
     .format(rfc.score(X_test, y_test)))
print('Clasification report of Random Forest on validation set: \n{}'
     .format(classification_report(y_test, y_pred_rfc)))
print(confusion_matrix(y_test, y_pred_rfc))
Accuracy of Random Forest on training set: 1.00
Accuracy of Random Forest  on validation set: 0.79
Clasification report of Random Forest on validation set: 
              precision    recall  f1-score   support

      Bronze       0.86      0.76      0.81      3931
        Gold       0.70      0.77      0.73      3243
    Platinum       0.77      0.80      0.79      1828
      Silver       0.80      0.81      0.80      5715

   micro avg       0.79      0.79      0.79     14717
   macro avg       0.78      0.78      0.78     14717
weighted avg       0.79      0.79      0.79     14717

[[2977  145   51  758]
 [  43 2481  292  427]
 [  16  340 1468    4]
 [ 428  563   95 4629]]

Finding the best parameters for Random forest:

In [71]:
# Number of features to consider at every split
max_features = ['auto', 'sqrt']
# Maximum number of levels in tree
max_depth = [int(x) for x in np.linspace(10, 80, num = 11)]
max_depth.append(None)
# Minimum number of samples required to split a node
min_samples_split = [2, 5, 10]
# Minimum number of samples required at each leaf node
min_samples_leaf = [1, 2, 4]
# Method of selecting samples for training each tree
bootstrap = [True, False]

random_grid = {
               'max_features': max_features,
               'max_depth': max_depth,
               'min_samples_split': min_samples_split,
               'min_samples_leaf': min_samples_leaf,
               'bootstrap': bootstrap}
print(random_grid)
{'max_features': ['auto', 'sqrt'], 'max_depth': [10, 17, 24, 31, 38, 45, 52, 59, 66, 73, 80, None], 'min_samples_split': [2, 5, 10], 'min_samples_leaf': [1, 2, 4], 'bootstrap': [True, False]}
In [72]:
from sklearn.model_selection import RandomizedSearchCV
# Use the random grid to search for best hyperparameters
# First create the base model to tune
rf = RandomForestClassifier()
# Random search of parameters, using 3 fold cross validation, 
# search across 100 different combinations, and use all available cores
rf_random = RandomizedSearchCV(estimator = rf, param_distributions = random_grid, n_iter = 10, cv = 3, verbose=2, random_state=42, n_jobs = -1)
# Fit the random search model
rf_random.fit(X_train, y_train)
y_pred_rf = rf_random.predict(X_test)
Fitting 3 folds for each of 10 candidates, totalling 30 fits
[Parallel(n_jobs=-1)]: Using backend LokyBackend with 4 concurrent workers.
[Parallel(n_jobs=-1)]: Done  30 out of  30 | elapsed:   22.2s finished
In [73]:
rf_random.best_params_
Out[73]:
{'min_samples_split': 10,
 'min_samples_leaf': 2,
 'max_features': 'sqrt',
 'max_depth': 73,
 'bootstrap': False}

Random Forest tuned for Hyper Parameters

In [76]:
# ---- Random Forest with best parameters -----------
from sklearn.ensemble import RandomForestClassifier

rfc = RandomForestClassifier(n_estimators=10,min_samples_split= 10,min_samples_leaf= 2,max_features='sqrt',max_depth=73,bootstrap=False)
rfc.fit(X_train, y_train)
y_pred_rfc = rfc.predict(X_test)
In [77]:
print('Accuracy of Random Forest on training set: {:.2f}'
     .format(rfc.score(X_train, y_train)))
print('Accuracy of Random Forest  on validation set: {:.2f}'
     .format(rfc.score(X_test, y_test)))
print('Classification report of Random Forest on validation set: \n{}'
     .format(classification_report(y_test, y_pred_rfc)))
print(confusion_matrix(y_test, y_pred_rfc))
Accuracy of Random Forest on training set: 0.94
Accuracy of Random Forest  on validation set: 0.78
Classification report of Random Forest on validation set: 
              precision    recall  f1-score   support

      Bronze       0.85      0.76      0.80      3931
        Gold       0.70      0.75      0.73      3243
    Platinum       0.77      0.79      0.78      1828
      Silver       0.79      0.81      0.80      5715

   micro avg       0.78      0.78      0.78     14717
   macro avg       0.78      0.78      0.78     14717
weighted avg       0.78      0.78      0.78     14717

[[2977  138   51  765]
 [  45 2445  291  462]
 [  18  355 1446    9]
 [ 458  545  102 4610]]

XG Boost Classifier

In [78]:
#XGBoost Multiclassification Model
from xgboost import XGBClassifier

xgb = XGBClassifier()
xgb.fit(X_train, y_train)
y_pred_xgb = xgb.predict(X_test) 

print('Accuracy of XGBoost on training set: {:.2f}'
     .format(xgb.score(X_train, y_train)))
print('Accuracy of XGBoost  on validation set: {:.2f}'
     .format(xgb.score(X_test, y_test)))
print('Classification report of XGBoost on validation set: \n{}'
     .format(classification_report(y_test, y_pred_xgb)))
print(confusion_matrix(y_test, y_pred_xgb))
Accuracy of XGBoost on training set: 0.78
Accuracy of XGBoost  on validation set: 0.78
Classification report of XGBoost on validation set: 
              precision    recall  f1-score   support

      Bronze       0.86      0.71      0.78      3931
        Gold       0.71      0.76      0.73      3243
    Platinum       0.76      0.82      0.79      1828
      Silver       0.78      0.82      0.80      5715

   micro avg       0.78      0.78      0.78     14717
   macro avg       0.78      0.78      0.78     14717
weighted avg       0.78      0.78      0.78     14717

[[2804  169   80  878]
 [  40 2459  283  461]
 [  19  306 1502    1]
 [ 397  526  104 4688]]

Naive Bayes Classifier

In [79]:
# ---- Naive Bayes -----------
from sklearn.naive_bayes import GaussianNB

gnb = GaussianNB()
gnb.fit(X_train, y_train)
y_pred_gnb = gnb.predict(X_test) 
print('Accuracy of GNB classifier on training set: {:.2f}'
     .format(gnb.score(X_train, y_train)))
print('Accuracy of GNB classifier on validation set: {:.2f}'
     .format(gnb.score(X_test, y_test)))
print('Classification report of Gaussian Naive Bayes on validation set: \n{}'
     .format(classification_report(y_test, y_pred_gnb)))
print(confusion_matrix(y_test, y_pred_gnb))
Accuracy of GNB classifier on training set: 0.58
Accuracy of GNB classifier on validation set: 0.58
Classification report of Gaussian Naive Bayes on validation set: 
              precision    recall  f1-score   support

      Bronze       0.84      0.36      0.50      3931
        Gold       0.48      0.40      0.44      3243
    Platinum       0.57      0.61      0.59      1828
      Silver       0.57      0.84      0.68      5715

   micro avg       0.58      0.58      0.58     14717
   macro avg       0.61      0.55      0.55     14717
weighted avg       0.62      0.58      0.57     14717

[[1405  284  154 2088]
 [  39 1294  500 1410]
 [   8  594 1115  111]
 [ 228  522  184 4781]]

Ada Boost Classifier

In [80]:
# ---- Ada Boost -----------
from sklearn.ensemble import AdaBoostClassifier
abc = AdaBoostClassifier(random_state=1)
abc.fit(X_train, y_train)
y_pred_abc = abc.predict(X_test) 

print('Accuracy of AdaBoost on training set: {:.2f}'
     .format(abc.score(X_train, y_train)))
print('Accuracy of AdaBoost  on validation set: {:.2f}'
     .format(abc.score(X_test, y_test)))
print('Classification report of Ada Boost on validation set: \n{}'
     .format(classification_report(y_test, y_pred_abc)))
print(confusion_matrix(y_test, y_pred_abc))
Accuracy of AdaBoost on training set: 0.70
Accuracy of AdaBoost  on validation set: 0.69
Classification report of Ada Boost on validation set: 
              precision    recall  f1-score   support

      Bronze       0.76      0.52      0.62      3931
        Gold       0.66      0.68      0.67      3243
    Platinum       0.72      0.82      0.77      1828
      Silver       0.67      0.78      0.72      5715

   micro avg       0.69      0.69      0.69     14717
   macro avg       0.70      0.70      0.69     14717
weighted avg       0.70      0.69      0.69     14717

[[2027  147   64 1693]
 [ 115 2202  425  501]
 [  27  284 1508    9]
 [ 487  696  102 4430]]

Confusion Matrix for the Best Model

In [81]:
# Creates a confusion matrix
cm = confusion_matrix(y_test, y_pred_rfc) 

# Transform to df for easier plotting
cm_df = pd.DataFrame(cm,
                     index = ['Platinum','Gold','Silver','Bronze'], 
                     columns = ['Platinum','Gold','Silver','Bronze'])

plt.figure(figsize=(8,6))
sns.heatmap(cm_df, annot=True,fmt='2.0f')
plt.title('Random Forest \nAccuracy:{0:.3f}'.format(accuracy_score(y_test, y_pred_rfc)))
plt.ylabel('True label')
plt.xlabel('Predicted label')
plt.show()

Shap Feature Importance: (Model Explainability)

We now try to obtain the features that contribute the most to our Random forest model. We can use Shap feature explanation for the same.

In [82]:
import shap
pd.set_option("display.max_columns", 96)
pd.set_option("display.max_rows", 96)

plt.rcParams['figure.figsize'] = (12, 9)
plt.style.use('ggplot')

shap.initjs()
In [83]:
# DF, based on which importance is checked
X_importance = X_test

# Explain model predictions using shap library:
explainer = shap.TreeExplainer(rfc)
shap_values = explainer.shap_values(X_importance)
In [95]:
# Plot summary_plot
shap.summary_plot(shap_values, X)

Improving the Model Accuracy: (Grid Search)

In [98]:
grid_param = {
    'n_estimators': [100, 200, 300],
    'criterion': ['gini', 'entropy'],
    'bootstrap': [True, False]
}
In [100]:
from sklearn.model_selection import GridSearchCV

gd_sr = GridSearchCV(estimator=rfc,
                     param_grid=grid_param,
                     scoring='accuracy',
                     cv=5,
                     n_jobs=-1)
In [101]:
gd_sr.fit(X_train, y_train)
Out[101]:
GridSearchCV(cv=5, error_score='raise-deprecating',
       estimator=RandomForestClassifier(bootstrap=False, class_weight=None, criterion='gini',
            max_depth=73, max_features='sqrt', max_leaf_nodes=None,
            min_impurity_decrease=0.0, min_impurity_split=None,
            min_samples_leaf=2, min_samples_split=10,
            min_weight_fraction_leaf=0.0, n_estimators=10, n_jobs=None,
            oob_score=False, random_state=None, verbose=0,
            warm_start=False),
       fit_params=None, iid='warn', n_jobs=-1,
       param_grid={'n_estimators': [100, 200, 300], 'criterion': ['gini', 'entropy'], 'bootstrap': [True, False]},
       pre_dispatch='2*n_jobs', refit=True, return_train_score='warn',
       scoring='accuracy', verbose=0)

The Best Estimators Obtained for the model after Grid Search CV are:

In [102]:
best_parameters = gd_sr.best_params_
print(best_parameters)
{'bootstrap': False, 'criterion': 'gini', 'n_estimators': 300}

Results of Best Estimator:

In [103]:
best_result = gd_sr.best_score_
print(best_result)
0.7871946454659736

The accuracy we have obtained after tuning the best model is 78.71%. We can further improve the performance of this model by stacking. We can combaine Random Forest and XGBoost algorithms to make better predictions!

Stacking

In [108]:
from mlxtend.classifier import StackingClassifier
from mlxtend.plotting import plot_learning_curves
from mlxtend.plotting import plot_decision_regions
from sklearn import model_selection
In [109]:
clf1 = RandomForestClassifier(random_state=0,n_estimators=300, max_depth=73, criterion = 'gini')
clf2 = XGBClassifier()
clf3 = GaussianNB()
rfc = RandomForestClassifier(n_estimators=100)
sclf = StackingClassifier(classifiers=[clf1, clf2, clf3], 
                          meta_classifier=rfc)
In [113]:
# Import label encoder 
from sklearn import preprocessing 
  
# label_encoder object knows how to understand word labels. 
label_encoder = preprocessing.LabelEncoder() 
  
# Encode labels in column 'species'. 
y_train = label_encoder.fit_transform(y_train) 
y_test = label_encoder.fit_transform(y_test) 
In [115]:
sclf = StackingClassifier(classifiers=[clf1, clf2, clf3], 
                          meta_classifier=rfc) 
sclf.fit(X_train, y_train)
y_pred_sclf = sclf.predict(X_test) 

print('Accuracy of Stacking classifier on training set: {:.2f}'
     .format(sclf.score(X_train, y_train)))
print('Accuracy of Stacked Classifier on test set: {:.2f}'
     .format(sclf.score(X_test, y_test)))
print('Clasification report of Stacked Classifier on Test set: \n{}'
     .format(classification_report(y_test, y_pred_sclf)))
print(confusion_matrix(y_test, y_pred_sclf))
Accuracy of Stacking classifier on training set: 1.00
Accuracy of Stacked Classifier on test set: 0.79
Clasification report of Stacked Classifier on Test set: 
              precision    recall  f1-score   support

           0       0.86      0.76      0.80      3931
           1       0.70      0.77      0.74      3243
           2       0.77      0.80      0.79      1828
           3       0.80      0.81      0.80      5715

   micro avg       0.79      0.79      0.79     14717
   macro avg       0.78      0.78      0.78     14717
weighted avg       0.79      0.79      0.79     14717

[[2969  151   51  760]
 [  44 2491  281  427]
 [  17  340 1467    4]
 [ 436  552  100 4627]]

The Stacking Classifier is overfitting the training set and is not able to predict the test set perfectly as it has only 79% accuracy. Even though we have an improved accuracy by 1%, we reject this model due to overfitting!

This concludes that the best accuracy that can be obtained is 79% from Random Forest (With tuned Parameters). To further improve the accuracy of the model, we have to feed more rich data to the input.